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Abstract. A suite of vast stellar surveys mapping the Milky Way, culminating 
in the Gaia mission, is revolutionizing the empirical information about the distri- 
bution and properties of stars in the Galactic stellar disk. We review and lay out 
what analysis and modeling machinery needs to be in place to test mechanism of 
disk galaxy evolution and to stringently constrain the Galactic gravitational po- 
tential, using such Galactic star-by-star measurements. We stress the crucial role 
of stellar survey selection functions in any such modeling; and we advocate the 
utility of viewing the Galactic stellar disk as made up from 'mono-abundance 
populations' (MAPs), both for dynamical modeling and for constraining the 
Milky Way's evolutionary processes. We review recent work on the spatial and 
kincmatical distribution of MAPs, and lay out how further study of MAPs in 
the Gaia era should lead to a decisively clearer picture of the Milky Way's dark 
matter distribution and formation history. 
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1. Introduction 

In the world of galaxies, the Milky Way is in many ways as typical as it gets: 
half of the Universe's present-day stars live in galaxies that match our Milky 
Way in stellar mass, size, chemical abundance, etc. within factors of a few (e.g., 
Mo et al. 2010). But for us, it is the only galaxy whose stellar distribution we 
can see in its full dimensionality: star-by-star we can obtain 3D positions and 3D 
velocities (vios, ^-i, fJ-b), coupled with the stars' photospheric clement abundances 
and constraints on their ages. We know that in principle this enormous wealth 
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of information about the stellar body of our Galaxy holds a key to recognizing 
and understanding some of the mechanisms that create and evolve disk galaxies. 
And it holds a key to mapping the three-dimensional gravitational potential 
and by implication the dark matter distribution in the Milky Way. A sequence 
of ongoing photometry and spectroscopy surveys has recently hundred-folded 
the number of stars with good distances, radial and transverse velocities, and 
abundance estimates; this only forebodes the data wealth expected from ESA's 
flagship science Mission Gaia to be launched next year. 

Yet, practical approaches to extract the enormous astrophysical information 
content of these data remain sorely underdeveloped. It is not even qualitatively 
clear at this point what will limit the accuracy of any galaxy-formation or dark- 
matter inferences: the sample sizes, the fraction of the Milky Way's stellar body 
covered (cf. Fig. 1), the precision of the x- v phase-space measurements, the 
quality and detail of the abundance information, or the (lack of) stellar age 
estimates. Are dynamical analyses limited by the precision with which sample 
selection functions can be specified, or by the fact that dust obscuration and 
crowding will leave the majority (by stellar-mass-weighted volume) of Milky 
Way stars unobserved even if all currently planned experiments worked our per- 
fectly? Or are dynamical inferences limited by the fact that the symmetry and 
equilibrium assumptions, which underlie most dynamical modeling, are only ap- 
proximations? 

The ongoing data deluge and the continuing recognition how much our Milky 
Way, despite being 'just one galaxy', may serve as a Rosetta Stone for galaxy 
studies, have triggered a great deal of preparatory work on how to analyze and 
model these data. Many of the scientific and practical issues have been laid 
out, e.g., in Turon et al. (2008) or Binney (2011). Yet, it appears (at least to 
the authors) that the existing survey interpretation and modeling approaches 
are still woefully inadequate to exploit the full information content of even the 
existing Galactic stellar surveys, let alone the expected information content of 
Gaia data. 

And while the science theme of 'understanding the current structure of the 
Galaxy and reconstructing its formation history' remains of central interest to 
astrophysics, the specific questions that should be asked of the data have evolved 
since the original Gaia mission science case was laid out, through advances in 
our understanding of galaxy formation in the cosmological framework and re- 
cent work on secular galaxy evolution. The aim of simply 'reconstructing' the 
formation history of the Galaxy in light of even idealized Gaia data, now seems 
naive. 

The review here will restrict itself to the Galactic stars (as opposed to the 
Galactic interstellar medium), and in particular to the Milky Way's dominant 
stellar component, its Disk, which contains about three quarters of all Galactic 
stars. As a shorthand, we will use the term Disk (capitalized) to refer to the 
'Milky Way's stellar disk'. All other disks will be labeled by qualifying adjectives. 
The discussion will focus on the stellar disk of our Galaxy and dark matter in the 
cenira/ parts of the overall Milky Way halo, (< 0.05 x i?viriai ^ 12kpc), with only 
cursory treatment of the other stellar components, the gas and dust in the Milky 
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Fig. 1 A view of our Galaxy and the effective volume that Gaia will survey 
(courtesy X. Luri & A. Robin), based on current simulations of Gaia mock 
catalogs. Even in the age of Gaia, dust extinction and image crowding will limit 
the exploration of the Disk to only a quadrant with optical surveys. 
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Way, and the overall halo structure. Perhaps our neglect of the Galactic bar is 
most problematic in drawing scope boundaries for this review, as the interface 
between disk and bar is both interesting and unclear, and because it is manifest 
that the present-day Galactic bar has an important dynamical impact on the 
dynamics and evolution of the Disk; yet, the Galactic bar's role and complexity 
warrants a separate treatment. 

The questions about galaxy disk formation that a detailed analysis of the 
Milky Way may help answer are manyfold. What processes might determine 
galaxy disk structure? In particular, what processes set the exponential radial 
and vertical profiles seen in the stellar distributions of galaxy disks? Were all or 
most stars born from a well-settled gas disk near the disk plane and acquired their 
vertical motions subsequently? Or was some fraction of disk stars formed from 
very turbulent gas early on (e.g., Bournaud et al. 2009; Ceverino et al. 2012), 
forming a primordial thick disk? Are there discernible signatures of the stellar en- 
ergy feedback to the interstellar medium that global models of galaxy formation 
have identified as a crucial 'ingredient' of (disk) galaxy formation (Nath & Silk 
2009; Hopkins et al. 2012)? What was the role of internal heating in shaping 
galaxy disks? What has been the role of radial migration (Sellwood & Binney 
2002; Roskar et al. 2008a,b; Schonrich & Binney 2008; Minchev et al. 2011), i.e., 
the substantive changes in the stars' mean orbital radii that are expected to oc- 
cur without boosting the orbits' eccentricity? What was the disk-shaping role 
of minor mergers (e.g., Abadi et al. 2003), which are deemed an integral part of 
the ylCDM cosmogony? How much did in-falling satellites impulsively heat the 
Milky Way's disk, potentially leading to a distinct thick disk (Villalobos & Helmi 
2008)? How much stellar debris did they deposit in this process? Is the radial 
orbit migration induced by satellite infall (Bird et al. 2012) distinguishable from 
purely internal processes? All of these questions are not only relevant for the 
Milky Way in particular, but lead generically to the question of how resilient 
stellar disks are to tidal interactions; it has been claimed (e.g., Kormendy et al. 
2010; Shen et al. 2010) that the existence of large, thin stellar disks poses a 
challenge to the merger-driven ylCDM picture. 

In the end, answers to these questions require a multi-faceted comparison of 
the Disk's observable status quo to the expectations from ab initio formation 
models, in practice for the most part hydrodynamical simulations. However, the 
current generation of 'cosmological' disk galaxy formation models is more illus- 
trative than exhaustive in their representation of possible disk galaxy formation 
histories; therefore the question of how to test for the importance of galaxy 
formation ingredients through comparison with observational data is actively 
underway. 

The maximal amount of empirical information about the Disk that one can 
gather from data is a joint constraint on the gravitational potential in which 
stars orbit <P{x^t) and on the chemo-orbital distribution function of the Disk's 
stars. How to best obtain such a joint constraint is a problem solved in principle, 
but not in practice (see Sections 2.3 & 5 ). 

It is in this context that this review sets out to work towards three broad 
goals: 
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— synthesize what the currently most pertinent questions about dark matter, 
disk galaxy formation and evolution are that may actually be addressed with 
stellar surveys of the Galactic disk. 

— lay out what 'modeling' of large stellar samples means and emphasize some 
of the practical challenges that we see ahead. 

— describe how recent work may change our thinking about how to best address 
these questions. 

Compared to Turon et al. (2008) and Ivezic, Beers & Juric (2012) for an 
empirical description of the Milky Way disk, and compared to a series of pa- 
pers by Binney and collaborators on dynamical modeling of the Galaxy (Binney 
2010; Binney & McMiUan 2011; Binney 2012; McMillan & Binney 2012), we 
place more emphasis two aspects that we deem crucial in Galactic disk mod- 
eling: 

— the consideration of 'mono-abundance' stellar sub-populations (MAPs), ask- 
ing the question of 'what our Milky Way disk would look like if we had eyes 
for stars of only a narrow range of photospheric abundances'. The importance 
of 'mono-abundance components' arises from the fact that in the presence of 
significant radial migration, chemical abundances are the only life-long tags 
(see Freeman & Bland-Hawthorne 2002) that stars have, which can be used 
to isolate sub-groups independent of presuming a particular dynamical his- 
tory. In a coUisionless system, such populations can be modeled completely 
independently; yet they have to 'feel' the same gravitational potential. 

— the central importance of the (foremost spatial) selection function of any 
stellar sample that enters modeling. Dynamical modeling links the stars' 
kinematics to their spatial distribution. Different subsets of Disk stars (dif- 
fering e.g. by abundance) have dramatically different spatial and kinematical 
distributions. If the spatial selection function of any sub-set of stars with 
measured kinematics is not known to better than some accuracy, this will 
pose a fundamental limitation on dynamical inferences, irrespective of how 
large the sample is; with ever larger samples emerging, understanding the 
selection function is increasingly probably to be a limiting factor in the anal- 
ysis. 

The remainder of the review is structured as follows: in Section 2 we discuss 
in more detail the overall characterization of the Milky Way's disk and detail the 
open questions of stellar disk formation and evolution in a cosmological context. 
In Section 3 we provide an overview of the existing and emerging stellar Galactic 
surveys and in Section 4 we describe how the survey selection function can and 
should be rigorously handled in modeling; . In Sections 5 and 6 we present recent 
results in dynamical and structural modeling of the Disk, and their implications 
for future work. In the closing Section 7, we discuss what we deem the main 
practical challenges and promises for this research direction in the next years. 
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2. Galactic Disk studies: an overview 

'Understanding' the Disk could mean having a comprehensive empirical charac- 
terization for it and exploring which - possibly competing - theoretical concepts 
that make predictions for these characteristics match, or do not. As the Milky 
Way is only one particular galaxy and as disk galaxy formation is a complex 
process that predicts broad distributions for many properties, it may be useful 
to consider which Disk characteristics generically test formation concepts, rather 
than simply representing one of many possible disk formation outcomes. 

2.1. Characterizing the current structure of the Disk 

In a casual, luminosity or mass-weighted average, the Disk can be char- 
acterized as a highly flattened structure with an (exponential) radial scale 
length of 2.5 — 3kpc and scale height of ~ 0.3 kpc (e.g., Kent et al. 1991; 
Lopez-Corredoira et al. 2002; McMillan 2011), which is kinematically cold in 
the sense that the characteristic stellar velocity dispersions near the Sun of 
cr^ ~ cr^ ~ (j/j/1.5 ~ 25 km s~^ are far less than Vcm — 220km s~^. Current 
estimates for the overall structural parameters of the Milky Way are compiled 
in Table 1.2 of Binney & Tremaine (2008); more specifically estimates for the 
mass of the Disk are ~5 x 10^° M© (Flynn et al. 2006; McMillan 2011), though 
the most recent data sets have not yet been brought to bear on this basic 
number. No good estimates for the globally averaged metallicity of the Disk 
exist, though ([Fe/H]) being about at the solar value seems likely. 

With these bulk properties, the Milky Way and its Disk are very 'typical' in 
the realm of present-day galaxies: comparable numbers of stars in in the low- 
redshift Universe live in galaxies larger and smaller (more and less metal-rich) 
than the Milky Way. For its stellar mass, the structural parameters of the Disk 
are also not exceptional (e.g., van der Kruit & Freeman 2011). Perhaps the most 
unusual aspect of the Milky Way is that its stellar disk is so dominant, with a 
luminosity ratio of bulge-to-disk of about 1:5 Kent et al. (1991): most galaxies 
of M* > 5 X 10^° Mq are much more bulge dominated (Kauffmann et al. 2003). 

But describing the Disk by 'characteristic' numbers, as one is often forced to 
do in distant galaxies, does not even begin to do justice to the rich patterns that 
we see in the Disk: it has been long established that positions, velocities, chemical 
abundances, and ages are very strongly and systematically correlated. This is in 
the sense that younger and/or more metal rich stars tend to be on more nearly 
circular orbits with lower velocity dispersions. Of course, stellar populations with 
lower (vertical) velocity dispersions will form a thinner disk component. This 
has led to the approach of defining sub-components of the Disk on the basis of 
the spatial distribution, kinematics, or chemical abundances. Most common has 
been to describe the Disk in terms of a dominant thin disk and a thick disk, with 
thin-thick disk samples of stars defined spatially, kinematically, or chemically. 
While these defining properties are of course related, they do not isolate identical 
subsets of stars. Whether it is sensible to parse the Disk structure into only two 
distinct components is discussed below. 
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Fig. 2 Stellar number density map of the Disk and halo from Juric et al. (2008), 
for K-stars with colors 0.6 < i < 0.65. The map, averaged over the ^-direction 
was derived drawing on SDSS photometry, and applying photometric distance 
estimates that presume (sensibly) that the vast majority of stars are on the 
main sequence, not giants. Note that for these colors the main sequence stars 
will sample all ages fairly, as their MS luminosity remains essentially unchanged. 
With increasing \z\ and i?, however, the mean metallicity of the stars changes, 
and the stars within that color range represent different masses and luminosities. 

Much of what we know about these spatial, kinematical, and chemical cor- 
relations within the Disk has come until very recently from very local sam- 
ples of stars, either from studies at i? ~ i?© or from the seminal and pivotal 
Hipparcos/Geneva-Copenhagen sample of stars drawn from within ~ 100 pc 
(ESA 1997; Nordstrom et al. 2004). As dynamics links local and global prop- 
erties, it is perfectly possible and legitimate to make inferences about larger 
volumes than the survey volume itself; yet, it is important to keep in mind that 
the volume-limited Geneva- Copenhagen sample encompasses a volume that cor- 
responds to two-millionths of the Disk's half-mass volume. Only recently have 
extensive samples beyond the solar neighborhood with p(a5, t), [X/H]) become 
available. 
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Fig. 3 Distribution in f^ — u^-space of very nearby stars in the Disk with distances 
from Hipparcos (~100pc; Dehnen 1998). The sample shows rich sub-structure 
of 'moving groups', some of which reflect stars of common birth origin, others 
are the result of resonant orbit trapping. 

While a comprehensive empirical description of the Disk (spatial, kinemat- 
ical, abundances) in the immediate neighborhood of the Sun has revealed rich 
correlations that need explaining, an analogous picture encompassing a sub- 
stantive fraction of the disk with direct observational constraints is only now 
emerging. 

But the Disk is neither perfectly smooth nor perfectly axisymmetric, as the 
above description implied. This is most obvious for the youngest stars that still 
remember their birthplaces in star clusters and associations and in spiral arms. 
But is also true for older stellar populations. On the one hand, the spiral arms 
and the Galactic bar arc manifest non-axisymmetric features, with clear signa- 
tures also in the Solar neighborhood (e.g., Dehnen 1998, 2000; Fux 2001; Quillen 
2003; Quillen & Minchev 2005). By now the strength of the Galactic bar and its 
pattern speed seem reasonably well established, both on the basis of photome- 
try (Binney et al. 1997; Bissantz & Gerhard 2002) and on the basis of dynamics 
(Dehnen 1999; Minchev et al. 2007). But we know much less about spiral struc- 
ture in the stellar disk. While the location of the nearby Galactic spiral arms 
have long been located on the bases of the dense gas geometry and distance mea- 
surements to young stars, the existence and properties of dynamically-important 
stellar spiral structure is completely open: neither has there been a sound mea- 
sure of a spiral stellar over-density that should have dynamical effects, nor has 
there been direct evidence for any response of the Disk to stellar spirals. Clar- 
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ifying the dynamical role of spiral arms in the Milky Way presumably has to 
await Gaia. 

In addition there is a second aspect of non-axisymmctric sub-structure, which 
is known to be present in the Disk, but which is far from being sensibly charac- 
terized: there are groups of chemically similar stars on similar but very unusual 
orbits (streams, e.g., Helmi et al. 1999; Navarro et al. 2004; Klement et al. 2008, 
2009) that point towards an origin where they were formed in a separate satel- 
lite galaxy and subsequently deposited in the Disk in a (minor) merger. The 
process of Disk heating and Disk augmentation through minor mergers has been 
simulated extensively, both with collisionless and with hydrodynamical simula- 
tions (e.g., Velazquez & White 1999; Abadi et al. 2003; Kazantzidis et al. 2008; 
Moster et al. 2010): these simulations have shown that galaxy disks can ab- 
sorb considerably more satellite infall and debris than initial estimates had sug- 
gested (Toth & Ostriker 1992). However, these simulations also indicated that — 
especially for prograde satellite infall — it is not always easy to recognize satellite 
debris by its orbit, once it has been incorporated into the disk. 

2.2. The formation and evolution of the Disk 

Explaining the formation and evolution of galaxy disks has a 50-year history, 
gaining prominence with the seminal papers by Mestel (1963) and Eggen et al. 
(1962). Yet, producing a galaxy through ab initio calculations that in its disk 
properties resembles the Milky Way has remained challenging to this day. 

Exploring the formation of galaxy disks through simplified (semi-)analytic 
calculations has yielded seemingly gratifying models, but at the expense of ig- 
noring 'detail' that is known to play a role. Milestones in this approach were 
in particular the work by Fall & Efstathiou (1980) that put Mestel's idea of gas 
collapse under angular momentum conservation into the cosmological context 
of an appropriately sized halo that acquired a plausible amount of angular mo- 
mentum through interactions with its environment: this appeared as a cogent 
explanation for galactic disk sizes. These concepts were married with the Press 
& Schechter (Press & Schechter 1973; Bond et al. 1991) formalism and its ex- 
tensions by Mo et al. (1998), to place disk formation in the context of halos that 
grew by hierarchical merging. This approach performed well in explaining the 
overall properties of the present-day galaxy disk population and also its redshift 
evolution (e.g., Somerville et al. 2008). 

Yet, trouble in explaining disks came with the efforts to explain galaxy disk 
formation using (hydrodynamical) simulations in a cosmological context. The 
first simulations (Katz & Gunn 1991), which started from unrealistically sym- 
metrical and quiescent initial conditions, yielded end-products that resembled 
observed galaxy disks. But then the field entered a 15 year period in which al- 
most all simulations produced galaxies that were far too bulge-dominated and 
whose stellar disks were either too small, or too anemic (low mass fraction), or 
both. Of course, it was clear from the start that this was a particularly hard 
problem to tackle numerically: the initial volume from which material would 
come (~ 500 kpc) and the thin-ness of observed disks (~ 0.25 kpc) implied very 
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Fig. 4 Output of the ERIS disk galaxy formation simulation aimed at following 
the formation history of a Milky Way like galaxy ab initio. The Figure shows a 
projection resembling the 2MASS map of the Galaxy, and shows that simulations 
have now reached a point where, with very quiescent merging histories, disk- 
dominated galaxies can result. 

large dynamic ranges, and the 'sub-grid' physics issue of when to form stars from 
gas and how this star-formation would feed back on the remaining gas played a 
decisive role. 

As of 2012. various groups (Agertz et al. 2011; Guedes et al. 2011; 
Martig et al. 2012; Stinson et al. 2013) have succeeded in running simula- 
tions that can result in large, disk-dominated galaxies, resembling the Milky 
Way in many properties (cf. Figure 4). This progress seems to have — in good 
part — arisen from the explicit or implicit inclusion of physical feedback pro- 
cesses that had previously been neglected. In particular, 'radiative feedback' or 
'early stellar feedback' from massive stars before they explode as Supernovae 
seems to have been an important missing feedback ingredient and must be 
added in simulations to the well-established supernovae feedback (Nath & Silk 
2009; Hopkins et al. 2012; Brook et al. 2012). With such — physically expected 
— feedback implemented, disk dominated galaxies can be made in ab initio 
simulations that have approximately correct stellar-to-halo mass fractions for a 
wide range of mass scales. In some of the other cases, cf. Guedes et al. (2011), 
the radiative feedback is not implemented directly, but cooling-suppression 
below 10^ K has a similar effect. It appears that the inclusion of this additional 
feedback not only makes disk dominated galaxies a viable simulation outcome, 
but also improves the match to the galaxy luminosity function and Tully-Fisher 
relations. 

This all constitutes long-awaited success, but is certainly far from having any 
definitive explanation for the formation of any particular galaxy disk, including 
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ours. In particular, a very uncomfortable dependence of the simulation outcome 
on various aspects of the numerical treatment remains, which has recently been 
summarized by Scannapieco et al. (2012). 

In addition to the fully cosmological simulations, a large body of work has 
investigated processes relevant to disk evolution from an analytic or numeri- 
cal perspective, with an emphasis on chemical evolution and the formation and 
evolution of thick-disk components. The chemical evolution of the Milky Way 
has been studied with a variety of approaches (e.g., Matteucci & Francois 1989; 
Gilmore et al. 1989; Chiappini et al. 2001) mostly with physically motivated but 
geometrically simplified models (not cosmological ab initio simulations): these 
studies have aimed at constraining the cosmological infall of 'fresh' gas (e.g., 
Fraternali & Binney 2008; Colavitti et al. 2008), explain the origin of radial 
abundance gradients (e.g., Prantzos & Aubert 1995), and explore the role of 
'galactic fountains', i.e. gas blown from the disk, becoming part of a rotating 
hot corona, and eventually returning to the disk (e.g., Marinacci et al. 2011). 
Recent work by Minchev et al. (2012b) has introduced a new approach to mod- 
eling the evolution of the Disk by combining detailed chemical evolution models 
with cosmological TV-body simulations. 

To explain in particular the vertical disk structure qualitatively different 
models have been put forward, including cosmologically-motivated mechanisms 
where stars from a disrupted satellite can be directly accreted (Abadi et al. 
2003), or the disk can be heated through minor mergers (Toth & Ostriker 
1992; Quinn et al. 1993; Kazantzidis et al. 2008; Villalobos & Helmi 2008; 
Moster et al. 2010) or experience a burst of star formation following a gas- 
rich merger (Brook et al. 2004). Alternatively, the disk could have been born 
with larger velocity dispersion than is typical at z « (Bournaud et al. 
2009; Ceverino et al. 2012), or purely internal dynamical evolution due to 
radial migration may gradually thicken the disk (Schonrieh & Binney 2009; 
Loebman et al. 2011; Minchev et al. 2012a). It is likely that a combination 
of these mechanisms is responsible for the present-day structure of the Disk, 
but what relative contributions of these effects should be expected has yet to 
be worked out in detail, and none of these have been convincingly shown to 
dominate the evolution of the Disk. 

Radial migration is likely to have a large influence on the observable prop- 
erties of the Disk, if its role in shaping the bulk properties of the disk were 
sub-dominant. The basic process as described by Sellwood & Binney (2002) con- 
sists of the scattering of stars at the corotation radius of transient spiral arms; 
at corotation such scattering changes the angular momentum, L^, of the orbit 
(~mean radius) without increasing the orbital random energy. A similar — and 
potentially more efficient — process was later shown to happen when the bar and 
spiral structure's resonances overlap (Minchev & Famaey 2010; Minchev et al. 
2011). Such changes in the mean orbital radii are expected to be of order unity 
within a few Gyrs, and to have a profound effect on the interpretation of the 
present-day structure of the Disk: e.g. present-day can no longer be used as a 
close proxy for the birth even for stars on near-circular orbits. Quantifying 
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the strength of radial migration in the Milky Way is one of the most pertinent 
action items for the next-generation of Milky Way surveys. 

A brief synthesis of the predictions from all these efforts is as follows: 

— Stellar disks should generically form from the inside out. More specifically, 
it is the low angular momentum gas that settles first near the centers of the 
potential wells forming stars at small radii. 

— Disk-dominated galaxies with disk sizes in accord with observations can 
emerge, if, and only if, there is no major merger since 2 ~1. 

— The luminosity- or mass- weighted radial stellar density profiles at late epochs 
resemble exponentials. 

— Stars that formed at earlier epochs {z > 1), when the gas fraction was far 
higher, do not form from gas disks that are as well-settled and thin as they 
are at z ~0 with dispersions of < 10km s^^. 

— Characteristic disk thicknesses or vertical temperatures of 400 pc and — 
25 km s~^ are plausible. 

— Material infall, leading to fresh gas supply and dynamical disk heating, and 
star formation is not smooth but quite variable, even episodic, with a great 
deal of variation among dark matter halos with the same overall properties. 
It is rare that one particular infall or heating event dominates. 

— Throughout their formation histories galaxies exhibit significant non- 
axisymmetries, which at epochs later than z ~ 1 resemble bars and spiral 
arms. Through resonant interactions, these structures may have an important 
influence on the evolution of stellar disks. 

2.3. The Disk and the Galactic gravitational potential 

Learning about the orbits of different Disk stellar sub-populations as a galaxy 
formation constraint and learning from these stars about the gravitational po- 
tential, 4>{x,t), are inexorably linked. Those orbits are generally described by 

a chemo-orbital distribution function, 0, [X/H], iagol^^(ic)) that quantifies 
the probability of being on an orbit labeled by J, cp for each subset of stars 

(characterized by, e.g., by their ages tagc or their chemical abundances [X/H] = 
[Fe/H], [a/Fe], . . .). Here, we chose to characterize the orbit (the argument of 
the distribution function) by J, actions or integrals of motion (which depend 
on both its observable instantaneous phase-space coordinates p{x,v) and on 
(l>{x,t)). Each star then also has an orbital phase (or angle), 0, whose distribu- 
tion is usually assumed to be uniform in [0, 2tt]. 

Unless direct accelerations are measured for stars in many parts of the 
Galaxy, many degenerate combinations of <P{x,t) and p( J, </>, [X/H], tagc) ex- 
ist, unless astrophysically-plausible constraints and/or assumptions are imposed: 
time-independent steady-state solutions, axisymmetric solutions, uniform phase 
distributions of stars on the same orbit, etc. This is the art and craft of stellar 
dynamical modeling (Binney & Tremaine 2008; Binney & McMillan 2011). 

Learning about the Galactic potential is one central aspect of such dynam- 
ical modeling. On scales larger than individual galaxies, the so-called standard 
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jICDM cosmology has been tremendously successful in its quantitative predic- 
tions. If certain characteristics for dark energy and dark matter are adopted 
the large-scale matter and galaxy distribution can be well explained (includ- 
ing baryon acoustic oscillations) and be linked to the fluctuations in the cosmic 
microwave background. On the scales of galaxies and smaller, theoretical predic- 
tions are more complex, both because all of these scales are highly non-linear and 
because the cooling baryons constitute an important, even dominant mass com- 
ponent. Indeed, the ylCDM paradigm seems to make at least two predictions that 
arc unsubstantiated by observational evidence, or even in seeming contradiction. 
Not only are numerous low-mass dark-matter halos predicted, almost completely 
devoid of stars (Kauffmann et al. 1993; Klypin et al. 1999; Moore et al. 1999), 
ylCDM simulations also predict that dark matter profiles are cuspy, i.e., have di- 
vergent dark matter densities towards their centers (Dubinski & Carlberg 1991; 
Navarro et al. 1996). In galaxies as massive as our Milky Way, baryonic processes 
could not easily turn the dark matter cusp into a core, as seems viable in low 
mass galaxies (e.g., Flores & Primack 1993; Pontzen & Governato 2012). There- 
fore, we should expect for the Milky Way that more than half of the mass within 
a sphere of ~ Rq should be dark matter. Yet microlensing towards the Milky 
Way bulge (Popowski et al. 2005; Hamadache et al. 2006; Sumi et al. 2006) in- 
dicates that in our own Galaxy most of the in-plane column density is made up 
of stars (e.g., Binney & Evans 2001). Measurements in individual external galax- 
ies remain inconclusive, because dynamical tracers only measure the total mass, 
but cannot separate the stellar and DM contributions. In the Milky Way, almost 
all stellar mass beyond the bulge is in a stellar disk and hence very flat, while 
the DM halos emerging from ylCDM simulations are spheroidal or ellipsoidal. 
So, mapping the Milky Way's mass near the disk plane as a function of radius 
through the vertical kinematics will break the so-called disk — halo degeneracy, 
when combined with the rotation curve and the outer halo mass profile: one can 
then separate the flat from the round-ish mass contributions. 

Perhaps the most immediate goal of dynamical Disk modeling is to deter- 
mine how much DM there is within the Solar radius: as little as implied by the 
microlensing results, or as much as predicted by vlCDM cosmology. A second 
goal for the Galactic potential is to determine whether the disk-like total mass 
distribution is as thin as the stellar counts imply; this is interesting, because the 
possibility of a thick dark matter disk has been raised (Read et al. 2008). Finally, 
we can combine precise constraints from near the Galactic disk with constraints 
from stellar halo streams (e.g., Koposov et al. 2010) to get the shape of the po- 
tential as a function of radius. This can then be compared to the expectations, 
e.g., of alternative gravity models, and provide another, qualitatively new test 
for the inevitability of some form of dark matter. 

On the other hand, the best possible constraints on <P{x) are necessary to 
derive the (chemo-)orbital distribution of stars (the 'distribution function', DF), 
as the orbit characteristic, such as the actions depend of course on both {x, v) 
of the stars and on <l>{x). 
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3. Stellar surveys of the Milky Way 

3.1. Survey desiderata 

The ideal survey would result in an all-encompassing catalog of stars throughout 
the Disk, listing their 3-D positions and 3-D velocities (a;,t>), elemental abun- 
dances ([X/H]), individual masses (M*), ages (tagc), binarity, and line-of-sight 
reddening (Ay), along with the associated uncertainties. Furthermore, these 
uncertainties should be 'small', when compared to the scales on which the multi- 
dimensional mean number density of stars n{x, v, [X/H], M,, tago) has structure. 
In practice, this is neither achievable in the foreseeable future, nor is it clear 
that a 'fuller' sampling of n{x, v, [X/H], A/,, tagc) is always worth the additional 
effort. Such an all-encompassing su rvey^ would imply that the probability of en- 
tering the catalog ispcompicto((ic, v, [X/H], Af*, <agc) — 1 across the entire relevant 
domain-^ of x, v, [X/H], Af*, and tage- 

Any realistic survey is a particular choice of compromise in this parameter 
space. Fundamentally, ^complete of any survey, i.e., the probability of any given 
star having ended up in the catalog, is always limited by quantities in the space 
of 'immediate observables', foremost by the stars' fluxes (or signal-to-noise) vis- 
a-vis a survey's flux limit or image-crowding limit. But these 'immediate ob- 
servables' are usually not the quantities of foremost astrophysical interest, say 
x,v, [X/H], m, iago- While full completeness is practicable with respect to some 
quantities, e.g., the angular survey coverage {i,b) and the velocities v, it is not 
in other respects. The effective survey volume will always be larger for more 
luminous stars, the survey's distance limit will always be greater in directions of 
lower dust extinction at the observed wavelength; and an all-sky survey at flxed 
exposure time will go less deep towards the bulge because of crowding. 

So, in general, 'completeness' is an unattainable goal. And while samples 
that are 'complete' in some physical quantity such as volume or mass are imme- 
diately appealing and promise easy analysis, their actual construction in many 
surveys comes at the expense of discarding a sizable (often dominant) fraction 
of the pertinent catalog entries. With the right analysis tools, understanding the 
survey (in-)completeness, and its mapping into the physical quantities of inter- 
est, is more important than culling 'complete' samples. We will return to this in 
Section4. . 

3.2. Observable quantities and physical quantities of interest 



The physical attributes about a star {x, v, [X/H], m, iagc) that one would like to 
have for describing and modeling the Disk are in general not direct observables. 
This starts out with the fact that the natural coordinates for {x, v) in the space 
of observables are {£,b,D) and (vios, fi£, fih), respectively, with a heliocentric 
reference system in position and velocity. 

^ For the Disk this domain is, e.g., 1 < Dec < 20kpc, \v\ < 600km s^^, etc. 
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In general, Galaetie star surveys fall into two seemingly disjoint categories: 
imaging and spectroscopy. Imaging surveys get parsed into catalogs that provide 
angular positions and fluxes (typically in 2 to 10 passbands) for discrete sources, 
once photometric solutions (e.g., Schlafly et al. 2012) and astrometric solutions 
(Pier et al. 2003) have been obtained. Multi-epoch surveys, or the comparison of 
different surveys from different epochs then provide proper motions, and — with 
sufficient precision — useful parallaxes (e.g.. Ferryman et al. 1997; Munn et al. 
2004). Spectroscopic surveys are carried out with different instrumentation or 
even as disjoint surveys, usually based on a pre-existing photometric catalog. 
These spectra, usually for vastly fewer objects, provide vios on the one hand 
and the spectra that allow estimates of the 'stellar parameters', Teff, log g, and 
[X/H](e.g., Nordstrom et al. 2004; Yanny et al. 2009). But 'imaging' and 'spec- 
troscopic' surveys are only seemingly disjoint categories, because multi-band 
photometry may be re-interpreted as a (very) low resolution spectrum and some 
stellar parameters (e.g., Tgff and [Fe/H]) can be constrained by photometry 
and/or spectra (e.g., Ivezic et al. 2008). 

The practical link between 'observables' and 'quantities of interest' warrants 
extensive discussion: along with the survey selection function (Section 4) it is 
one of the two key ingredients for any rigorous survey analysis. For some quan- 
tities, such as {£, b) and vios this link is quite direct; but even there a coordinate 
transformation is required, which involves Rq, ^lsr and of course the distances, 
as most dynamical or formation models are framed in some rest-frame system 
with an origin at the Galactic center. Similarly, converting estimates of i-ie,fJ-b 
to two components of v (and Sv) requires knowledge of the distance. In Fig. 5, 
we show a graphical model overview over the task of 'astrophysical parameter 
determination' in Galactic surveys. In the subsequent sub-sections, we discuss 
different aspects of this model. 

3.2.1. Distance estimates 

Glearly, a 'direct' distance estimate, one that is independent of any intrinsic 
property of the stars, such as parallax measurements, is to be preferred. At 
present, good parallax distance estimates exist for about 20,000 stars within 
~ 100 pc from Hipparcos (Ferryman et al. 1997; ESA 1997). And a successful 
Gaia mission will extend this to ~ 10^ stars within ~10kpc (de Bruijne 2012). 
However, a widely usable catalog with Gaia parallaxes is still 5 years away as 
of this writing, and even after Gaia most stars in the already existing wide-field 
photometric surveys (e.g., SDSS or FSl) will not have informative Gaia parallax 
estimates, simply because they are too faint. 

Therefore, it is important to discuss the basics and the practice of (spectro- 
)photometric distance estimates to stars, which has turned out to be quite pro- 
ductive with the currently available data quality. Such distance estimates, all 
based on the comparison of the measured flux to an inferred intrinsic luminos- 
ity or absolute magnitude, have a well-established track record in the Galaxy, 
when using so-called standard candles, such as BHB stars (Sirko et al. 2004; 
Xue et al. 2008) or 'red clump stars' in near-IR surveys (e.g., Alves et al. 2002). 
But of course, as the 'physical HR diagram' (L vs. Tes) in Fig. 6 shows, the 
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Fig. 5 Constraining Stellar Parameters from Observables in Milky Way Surveys: 
this Figure provides a schematic overview, in the form of a simplified graphical 
model, of the logical dependencies between the stellar observables in Galactic 
surveys (thick dotted ovals) and the main desiderata for each star (thin dashed 
ovals) , its stellar parameters and distance, given various prior expectations about 
galaxy formation, star formation and the Galactic dust distribution (top). The 
basic observables are: line-of-sight-velocity, vios, proper motions, /x, parallax tt, 
multi-band photometry mx. and photospheric parameters derived from spectra 
(Teff, log g, abundances, Z); most of them depend on the Sun's position xq 
through. Ax. The main desiderata are the star's mass Af*, age ^agoand abun- 
dances Z, along with its distance D from the Sun and the (dust) extinction along 
the line of sight. Ay- The prior probabilities of Af,, age tage, Z, D and Ayare 
informed by our notions about star formation (the IMF) the overall structure of 
the Galaxy and various constraints on the dust distribution. Overall the goal of 
most survey analysis is to determine the probability of the stellar observables for 
a given set of desiderata, which requires both isochrones and stellar atmospheric 
models (see Burnett & Binney 2010). In practice, most existing Galactic surveys 
analyses can mapped onto this scheme, with logical dependencies often replaced 
by assumed logical conditions (e.g. 'using dereddened fluxes', 'presuming the 
star is on the main sequence', etc.). This graphical model still makes number of 
simplifications and the velocities. 
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Fig. 6 'Physical' color-magnitude diagram, shown by BaSTI isochrones for two 
metallicities, which are spaced 0.2 dex in age between 10^ and 10^° yr. This 
diagram serves as a reference for the discussion of how to derive physical param- 
eters (distances, ages, abundances) for stars from obscrvablcs (cf. Section 3.2): 
the age-independence of the luminosity on the MS makes for robust photometric 
distance estimates; the age and abundance sensitivity of L and Toff near the 
turn-off and on the (sub-)giant branch makes for good age determinations, but 
only if parallax-based distances exist; etc. 

distance-independent observables greatly constrain luminosity and hence dis- 
tance for basically all stars. Foremost, these distance-independent observables 
are Toff or colors as well [X/H], potentially augmented by spectroscopic log g 
estimates. 

A general framework for estimating distances in the absence of parallaxes 
is given in Burnett et al. (2011) or Breddels et al. (2010), and captured by the 
graphical model in Fig. 5. Basically, the goal is to determine the distance modulus 
likelihood 



where the m; are the apparent magnitudes of the star in various passbands (i.e., 
also the 'colors') and Af* is its stellar mass, which predicts the star's luminosity 
and colors (at a given [Fe/H] and tagc, see Fig. 5). Note that [Fe/H] appears 
both as an observational constraint and a model parameter, which determines 
isochrone locations. The prior on ([Fe/H],tagc) may depend on the distance mod- 
ulus to use the different spatial distribution of stars of different metallicities (in 




p{{m„ [Fe/H]}|2?7W, A/„ [Fe/H],„od, tagc)pprior(M,) 

i^prior 



(1) 
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this case, the position of the sky would also be included to compute the 3D 
position) . 

To get distance estimates, one needs to marginalize over the nuisance param- 
eters (Af*, [Fe/H], and tagc in the case above), and one needs to spell out the 
external information (i.e. priors) on the relative probability of these nuisance 
parameters (e.g.. Bailer- Jones 2011). As Fig. 6 makes obvious, the precision of 
the resulting distance estimates depends on the evolutionary phase, the quality 
of observational information, and any prior information. To give a few examples: 
for stars on the lower main sequence, the luminosity is only a function of T^e 
or color, irrespective of age; hence it can be well determined (~ 5 — 10%) if the 
metallicity is well constrained ((5[Fe/H] < 0.2) and if the probability of the object 
being a (sub-)giant is low, either on statistical grounds or through an estimate 
of log g. Juric et al. (2008) showed that such precisions can be reached even 
if the metallicity constraints only come from photometry. On the giant branch, 
colors reflect metallicity (and also age) as much as they reflect luminosity; there- 
fore photometric distances are less precise. Nonetheless, optical colors good to 
~ 0.02 mag and metallicities with (5[Fe/H] < 0.2 can determine giant star dis- 
tances good to ~15%, if there is a prior expectation that the stars are old (Xue 
et al. 2012, in preparation). The power of spectroscopic data comes in determin- 
ing log g, Toff especially in the case of significant reddening, and in constrain- 
ing the abundances, especially [Fe/H]. While only high-dispersion and high-S/N 
spectra will yield tight log g estimates, even moderate-resolution and moderate- 
S/N spectra (as for the SEGUE and LAMOST surveys; see Section 3.3.2) suffice 
to separate giants from dwarfs (e.g., Yanny et al. 2009), thereby discriminat- 
ing the multiple branches of i(color, [Fe/H]) in the isochrones. While main- 
sequence turn-off stars have been widely used to make 3D maps of the Milky 
Way (mostly its halo; e.g., Belokurov et al. 2006; Juric et al. 2008; Bell et al. 
2008), they are among the stellar types abundantly found in typical surveys 
the ones for which precise (< 20%) spectrophotometric distances are hardest to 
obtain (e.g., Schonrich et al. 2011). 

The absolute scale for such spectro-photometric distance estimates is in most 
cases tied to open or globular clusters, which are presumed to be of known 
distance, age and metallicity (e.g., Pont et al. 1998; An et al. 2009). Because 
the number of open and globular clusters is relatively small and, in particular, 
old open clusters are rare, the sampling in known age and metallicity is sparse 
and non-uniform. Therefore, care is necessary to make this work for Disk stars 
of any age or metallicity. Interestingly, the panoptic sky surveys with kinematics 
offer the prospect of self-calibration of the distance scale (Schonrich et al. 2012) 
through requiring dynamical consistency between radial velocities and proper 
motions. 

3.2.2. Chemical abundances 

Chemical abundances are enormously important astrophysically, as they are wit- 
nesses of the Disk's enrichment history and as they are lifelong tags identifying 
various stellar sub-populations. Broadly speaking, metals are produced in stars 
as by-products of nuclear burning and dispersed into the interstellar medium 
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through supernova explosions and winds. This leads to a trend toward higher 
nictallicity as time goes on, with insidc-out formation leading to faster chemical 
evolution or metal-enrichment in the inner part of the Disk. While all supcrnovae 
produce iron, a-element enrichment occurs primarily through type II supernovae 
of massive stars with short lifetimes. Therefore, until type la supernovae, with 
typical delay times of 2 Gyr (e.g., Matteucci & Recchi 2001; Dahlen et al. 2008; 
Maoz et al. 2011), start occurring, the early disk's ISM, and stars formed out of 
it, has [a/Fe] that is higher than it is today. For a smooth star formation history, 
[a/Fe] decreases monotonically with time. However, a burst of star formation, 
e.g., following a gas-rich merger, could re- instate a higher [a/Fe], such that the 
relation between [a/Fe] and age is dependent on the star- formation history (e.g., 
Gilmore & Wyse 1991). 

Beyond the astrophysical relevance of abundance information, it is also of 
great practical importance in obtaining distances (Section 3.2.1) and ages (Sec- 
tion 3.2.3). Among the 'abundances' [X/H], the 'metallicity' [Fe/H] is by far 
of the greatest importance for distance estimates. Practical determinations of 
[X/H] faU into three broad categories: estimates (usually just [Fe/H], or [M/H] 
as a representative of the overall metal content) from either broad- or narrow- 
band photometry; estimates based on intermediate-resolution spectra (R=2000 
to 3000); or estimates from high-resolution spectra (R=10,000 and higher) that 
enable individual element abundance determinations. We discuss here briefly 
abundances obtained in these three different regimes. 

— Stellar Abundances from Photometry: That abundances can be constrained 
through photometry — at least for some stellar types — has been established 
for half a century (Wallerstein 1962; Stromgren 1966). The level of overall 
metal-line blanketing or the strength of particular absorption features varies 
with [Fe/H] (e.g., Bohm-Vitense 1989), which appreciably changes the broad- 
band fluxes and hence colors. How well this can be untangled from changes 
in Teff or other stellar parameters, depends of course on the mass and evo- 
lutionary phase of the star. The most widely used applications are the 'color 
of the red giant branch', which is a particularly useful metallicity estimate if 
there are priors on age and distance (e.g.McGonnachie et al. (2010) for the 
case of M31). In the Galaxy, variants of the 'UV excess method' (Wallerstein 
1962) have been very successfully used to estimate [Fe/H] for vast numbers 
of stars. In particular, Ivezic et al. (2008) calibrated the positions of F & 
G main sequence stars in the SDSS {u — g) — (g — r) color plane against 
spectroscopic metallicity estimates. Based on SDSS photometry, they showed 
that metallicity precisions of (5[Fe/H] ~0.1 to 0.3 can be reached, depending 
on the temperature and metallicity of the stars: in this way they determined 
[Fe/H] for over 2,000,000 stars. 

Some narrow- or medium-band photometric systems (foremost Stromgren 
filters; Stromgren 1966; Bessell et al. 2011) place filters on top of string ab- 
sorptions features, to constrain stellar parameters in particular metallicities 
can do even better in estimating [Fe/H]. Recently, this has been well illus- 
trated by Arnadottir et al. (2010) and Gasagrande et al. (2011), who com- 
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pared Stromgren photometry to abundances from high-resolution spectra, 
showing that [Fe/H] precisions of < 0.1 dex are possible in some Toff regimes. 
Casagrande at al. (2011) also showed that precise Stromgren photometry can 
constrain [a/Fe] to ~ 0.1 dex. In addition, narrow band photometry can pro- 
vide good metallicity estimates ((5[Fe/H] ~ 0.2) for a wider range of stellar 
parameters than SDSS photometry. 

Indeed, the literature is scattered with disputes that a certain photometric 
abundance determination precision is untenable, because the same approach 
yields poor [Fe/H] estimates for a sample at hand. In many cases, this seeming 
controversy can be traced back to the fact that different approaches work vary 
radically in their accuracy across the Toff, log g, tagc parameter plane. 
The abundance information from Gaia will be based on a spectral resolution 
of only R=15 to 80 for most stars; and while these data are produced through 
a dispersive element, they are probably best thought of as many- narrow-band 
photometry; for FGKM stars with g < 19 Gaia data are expected to yield 
S[Fe/R] < 0.2 (Liu et al. 2012). 

— Stellar Abundances from High-Resolution Spectroscopy: High-resolution 
spectroscopy (R> 10000), either in the optical or in the near-IR (Steinmetz et al. 
2006; AUende Prieto et al. 2008), is an indispensable tool to obtain in- 
dividual element abundances and to anchor any abundance scale in 
physically- motivated models of stellar photospheres (e.g., Asplund et al. 
2009; Edvardsson et al. 1993; Reddy et al. 2003; Bensby et al. 2005). The 
abundances are then fit by either modeling the spectrum directly in pixel 
or fiux space, or more commonly by parsing the observed and the model 
spectrum into a set of line equivalent widths which are compared in a 
sense. From typical existing spectra, about 10 to 20 individual abundances 
are derived, with relative precisions of 0.05 to 0.2 (e.g., Reddy et al. 2003; 
Boeche et al. 2011). High resolution spectroscopy has remained the gold 
standard for investigating chemo-dynamical patterns in the Milky Way in 
detail. 

— Stellar Abundances from Moderate-Resolution Spectroscopy: In recent years, 
large data sets at moderate spectral resolution^ have become available, fore- 
most by SDSS/SEGUE (Yanny et al. 2009) in the last years, but in the fu- 
ture also from LAMOST data (Deng et al. 2012). Such data can provide 
robust metallicities [Fe/H], good to (5 [Fe/H] < 0.2, and constrain [a/Fe] 
t0(5[a/Fe] < 0.15 (Lee et al. 2008a; Lee et al. 2011), for F, G and K stars. The 
accuracy of these data been verified against high-resolution spectroscopy and 
through survey spectra in globular clusters of known metallicity (Lee et al. 
2008b; Lee et al. 2011). Recently, Bovy et al. (2012c) have pursued an al- 
ternate approach to determine the precision (not accuracy) if abundance 
determinations [Fe/H], [a/Fe], through analyzing the abundance-dependent 
kinematics of Disk stars. They found that for plausible kinematical assump- 
tions, the SDSS spectra of G-dwarfs must be able to rank stars in [Fe/H] and 
[a/Fe] at the (0.15, 0.08) dex level, respectively. These precisions should be 

^ The RAVE survey, at R = 7500, is considered moderate resolution by some self- 
respecting stellar spectroscopists 
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compared to the range of [Fe/H] (about 1.5 dcx) and [a/Fe] (about 0.45 dex) 
of abundances found in the Disk, which illustrates that moderate resolution 
spectroscopy is very useful for isolating abundance-selected subsamples of 
stars in the Disk. 

3.2.3. Stellar ages 

Constraints on stellar ages are of course tremendously precious information for 
understanding the formation of the Disk, yet are very hard to obtain in practice. 
The review by Soderblom (2010) provides an excellent exposition of these issues, 
and we only summarize a few salient points here. For the large samples under 
discussion here, the absolute age calibration is not the highest priority, but the 
aim is to provide age constraints, even if only relative-age constraints, for as 
many stars as possible. In the terminology of Soderblom (2010), it is the 'model- 
based' or 'empirical' age determinations that are relevant here: three categories 
of them matter most for the current context (0.5 Gyr < tagc ^ 13Gyr), depend- 
ing on the type of stars and the information available. First, the chromospheric 
activity or rotation decay with increasing age of stars, leading to empirical rela- 
tions that have been calibrated against star clusters (e.g., Baliunas et al. 1995), 
especially for stars younger than a few Gyr. Soderblom et al. (1991) showed that 
the expected age precision for FGK stars is about 0.2 dex. Second, stellar seis- 
mology, probing the age-dependent internal structure can constrain ages well; 
the advent of superb light-curves from the Kepler mission, has just now enabled 
age constraints — in particular for giant stars — across sizable swaths of the Disk 
(van Grootel et al. 2010). 

The third, and most widely applicable approach in the Disk context, is the 
comparison between isochrones and a star's position in the observational or 
physical Hertzsprung-Russell diagram, i.e., the L (or log g) - Toff, [Fe/H] plane 
(see Fig. 6). Ideally, the set of observational constraints for a star, {datajobs, 
would be precise determinations of {datajobs = {-^7 Tcs, [Fe/H]}, or perhaps 
precise estimates of log 5, Toff, [Fe/H]. But until good parallaxes to D > 1 kpc 
exist, L is poorly constrained (without referring to Tog and [Fe/H] etc.) and 
the error bars on log g are considerable (~ 0.5 dex), leaving Toff, [Fe/H] as the 
well-determined observables. 

All these observable properties of a star depend essentially only on a few 
physical model parameters for the star M,, [Fe/H], iago (Fig. 5), and the observ- 
ables (at a given age) can be predicted through isochrones (e.g., Girardi et al. 
2002; Pietrinferni et al. 2004). For any set of observational constraints, the prob- 
ability of a star's age is then given by a similar expression as that for photometric 
distances in equation (1) (cf. Takeda et al. 2007; Burnett & Binney 2010): 



where p({data}obs|-^^*, [Fe/H], tago) is the probability of the data given the model 
parameters (i.e., the likelihood); the shape of that distribution is where the un- 
certainties on the observables are incorporated. Further, not all combinations of 




p({data}obs|M*,[Fe/H],tago) 

Pp (M* I tago )Pp ( [Fe/H] I tagc ) dM* d[Fe/H] , 



(2) 
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Af», [Fc/H],tagc arc equally likely, sinee we have prior information on pp( A/, |tagc) 
and pp([Fe/H]|tagc)- Those prior expectations come from our overall picture of 
Galaxy formation, from our knowledge of the stellar mass function, or they may 
reflect the sample properties, when analyzing any one individual star. To be spe- 
cific, Pp(M*|tagc) simply reflects the mass function, truncated at Af*.,„az:(^age), if 
one accepts that there has been an approximately universal initial mass function 
in the Disk. 

The integration, or marginalization, over Af* and [Fe/H], of course involves 
the isochrone- based prediction of observables {paramsjic, as illustrated in Fig. 5, 
so that 

p({data}obs I A/, , [Fe/H] , iago) ^ P ^{datajobs | {paramsjic (A//* , [Fe/H] , tagc) ^ , 

(3) 

where {paramsjic could for example be {L,Teff, [Fe/H]}ic(AfH., [Fe/H],iagc)- 

To get the relative probabflities of different presumed ages, £({data}obs|iagc), 
one simply goes over all combinations of Af* and [Fe/H] at tago and one integrates 
up how probable the observations are for each combination of A/* and [Fe/H]. 
This has been put into practice for various large samples, where luminosity 
constraints either come from parallaxes or from log g (see Nordstrom et al. 2004; 
Takeda et al. 2007; Burnett & Binney 2010). 

The quality of the age constraints depends dramatically both on the stellar 
evolutionary phase and on the quality of the observational constraints. It is worth 
looking at a few important regimes, using the isochrones for two metallicities in 
Fig. 6 as a guide. Note that stars with ages > 1 Gyr, which make up over 90% 
of Disk, correspond to the last 4 isochrones in this Figure. 

1) For stars on the lower main sequence (where main sequence lifetimes ex- 
ceed 10 Gyr; L < Lq), isochrone fitting provides basically no age constraints at 
all; in turn, photometric distance estimates are most robust there. For stars on 
the upper main sequence, equation (2) 'automatically' provides a simple upper 
limit on the age, given by the main sequence lifetime. 

2) For stars near the main-sequence turn-off and on the horizontal branch, 
the isochrones for a given metallicity are widely spread enabling good age deter- 
minations: Takeda et al. (2007) obtained a relative age precision for their stars 
with log g < 4.2 of ~ 15%. Similarly, the parallax-based luminosities of the stars 
in the Geneva-Copenhagen Survey (GCS) provide age estimate for stars off the 
main sequence of ~20%. Note that the [Fe/H] marginalization in equation (2) 
and the strong metallicity dependence of the isochrones in Fig. 6 illustrate how 
critical it is to have good metallicity estimates: without very good metallicities, 
even perfect Gaia parallaxes (and hence perfect luminosities) will not yield good 
age estimates across much of the color-magnitude diagram. 3) On the red giant 
branch, there is some L — Toff spread in the isochrones of a given metallicity; but 
for ages > 1 Gyr the metallicity dependence of Toff is so strong as to preclude 
precise age estimates. 

Note that the formalism of equation (2) provides at least some age constraints 
even in the absence of good independent L constraints (e.g., Burnett & Binney 



The Milky Way's Stellar Disk 



23 



2010), as the different stellar phases vary vastly in duration, which enters through 
the strong dependence in equation (2). 

3.2.4. Interstellar extinction 

The complex dust distribution in the Galaxy is of course very interesting in 
itself (e.g., Jackson et al. 2008), delineating spiral arms, star formation locations, 
and constraining the Galactic matter cycle. For the study of the overall stellar 
distribution of the Milky Way, it is foremost a nuisance and in some regimes, 
e.g., at very low latitudes, a near- fatal obstacle to seeing the entire Galaxy in 
stars (e.g., Nidever et al. 2012). Unlike in the analysis of galaxy surveys, where 
dust extinction is always in the foreground and can be corrected by some integral 
measure Ax{£,b) (from, e.g., Schlegel et al. 1998), one needs to understand the 
3D dust distribution in the Galaxy, A\{£,b, D). On the one hand, one needs 
to know, and marginalize out, the extinction to each star in a given sample, 
foremost to get its intrinsic properties. On the other hand, for many modeling 
applications one needs the full 3D extinction information Ax{£,b, D), also in 
directions where there are no stars, e.g., in order to determine the 'effective 
survey volume' (see below): it obviously makes a difference whether there are 
no stars in the sample with a given {£, h, D) because they are truly absent or 
because they have been extinguished below some sample flux threshold. 

Practical ways to constrain the extinction to any given star, given multi- 
band photometry or spectra, have recently been worked out by, e.g.. Bailer- Jones 
(2011) and Majewski et al. (2011) for different regimes of A^. In the absence of 
parallax distances or spectra for the stars, estimates oi Ax{£, b) inevitably involve 
mapping the stars' SED back to a plausible unreddened SED, the so-called stellar 
locus. This de-reddening constrains the amount of extinction, but only through 
the wavelength variation of the extinction, the reddening. 

Estimates of Tcff or L from spectra or from well-known parallax distances, 
allow of course far tighter constraints, because the de-reddened colors are con- 
strained a priori and because also the 'extinction' not just the 'reddening' ap- 
pears as constraints. 

How to take that star-by-star extinction information, potentially combine it 
with maps of dust emissivity (e.g., Planck Collaboration et al. 2011), to get a 
continuous estimate of A\{£, b, D) has not yet been established. This task is not 
straightforward, as emission line maps indicate that the fractal nature of dust 
(and hence) extinction distribution continues to very small scales. Presumably, 
some form of interpolation between the A\{£,b^D) to a set of stars, exploiting 
the properties of Gaussian processes, is a sensible way forward. 

3.2.5. Towards an optimal Disk survey analysis 

The task ahead is now to lay the foundation for an optimal survey analysis 
that will allow optimal dynamical modeling, by rigorously constructing the PDF 
for the physical quantities of interest, x,v,tagc, [X/H], Af, from the direct ob- 
servables. The preceding subsections show that many sub-aspects of the overall 
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Tabic 1. Stellar Photometric Surveys of the Milky Way 



Survey 




Period 


Sky Area 


# of Filters 


mag lim. 


i5[Fe/H] 


2MASS 




1998 - 


2002 


all sky 


5 


H=15 


N/A 


Skrutskie et al. 2006 








40,000 deg^ 


1.2/i - 2.2fi 






SDSS I-III 




2002 - 


2012 


North, 1>20° 


5 


g=22 


0.2 


Eisenstein et al. 2011 








15,000 deg^ 


0.4/j - 0.9ti 






PanSTARRSl 




2011 - 


2013 


S > -20 


5 


g=22 


0.4 


Kaiser et al. 2002 








30,000 deg^ 


0.5/i - l-O^i 






VHS 




2010 - 


2015 


South 


5 


J=20 


N/A 


McMahon et al., 2012, 


in prep. 






20,000 deg^ 


1.2/i - 2.2fi 






SkyMappcr 




2012 - 


2014 


South 


5 


g=21 


0.1 


Keller et al. 2007 








15,000 deg^ 









approach (Fig. 5) have been carried out and published; and a general frame- 
work has been spelled out by Burnett & Binney (2010). What is still missing is 
a comprehensive implementation that uses the maximal amount of information 
and accounts for all covariances, especially one that puts data sets from different 
surveys on the same footing. 

3.3. Existing and current Disk surveys 

There is a number of just-completed, ongoing, and imminent surveys of the stel- 
lar content of our Galaxy, all of which have different strengths and limitations. 
Tables 1 and 2 provide a brief overview of the most pertinent survey efforts. The 
(ground-based) surveys fall into two categories: wide-area multi-color imaging 
surveys and multi-object, fiber- fed spectroscopy. Broadly speaking, the ground- 
based imaging surveys provide the angular distribution of stars with complete 
(magnitude- limited) sampling, proper motions at the ~3nias yr~^ level (in con- 
junction with earlier imaging epochs), and photometric distances; for stars of 
certain temperature ranges (F through K dwarfs) they can also provide metal- 
licity estimates. 

The ground-based spectroscopic surveys have all been in 'follow-up' mode, 
i.e., they select their spectroscopic targets from one of the pre-existing photo- 
metric surveys, using a set of specific targeting algorithms. In most cases, the 
photometric samples are far larger than the number of spectra can be taken, so 
target selection is a severe downsampling, in sky area, in brightness or in color 
range. The survey spectra provide foremost radial velocities, along with good 
stellar photospheric parameters, including more detailed and robust elemental 
abundances. 

In this Section we restrict ourselves to surveys that have started taking 
science-quality data as of Summer 2012, with the exception of Gaia. 

3.3.1. Individual photometric surveys 

— 2MASS (Skrutskie et al. 2006): Designed as an 'all purpose' near- infrared 
(JHK) imaging survey, it has produced the perhaps most striking and clear- 
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Tabic 2. Stellar Spectroscopy Surveys of the Milky Way 



Survey 


Period 


Sky Area 


# of Spectra 


app. mags 


Sv [km/s] 


(5[Fe/H] 


char, distance 


GCS 


1981-2000 


South 


16,000 


V~10? 


0.5 


Indiv 


O.OOSkpc 


SEGUE I+II 


2004-2009 


North, 1>20° 


360,000 


g=15-20 


8 


0.2 


2 kpc 


RAVE 


200372012 


South 


370,0004- 


1=9-12 


3 


0.2 


0.5 kpc 


APOGEE 


2011-2014 


North, 1<20° 


100,000 


H<13.8 


0.5 


indiv. 


10 kpc 


Gaia-ESO 


2012-2015 


South 


150,000 


V<18 


0.5 


indiv. 


4 kpc 


LAMOST 


2012-2018 


North 


3,000,000 


V<18 


10 


0.2 


4 kpc 


Gaia 


2013-2018 


all sky 


50,000,000 


V<16 


10 


0.25 


4 kpc 



est view of the Milky Way's stellar distribution to date with half a Billion 
stars, owing to its ability to penetrated dust extinction better than optical 
surveys. Its imaging depth is sufficient to see, albeit not necessarily recognize, 
modestly extinct giant stars to distances > 10 kpc. It has been very success- 
fully used to map features in the Milky Way's outskirts, e.g., the Sagittarius 
stream (Majewski et al. 2003) and the Monoceros feature in the outer Disk 
(Rocha-Pinto et al. 2003) . It has been the photometric basis for several spec- 
troscopic surveys, in the Disk-context most notably APOGEE (see below). 
However, as a stand-alone survey, it has done not very much to advance our 
knowledge of the Disk, owing to the limitations of deriving robust distances 
and abundances from its data. But 10 years after its completion, it is still 
the only all-sky survey in the optical/near-IR region of the electromagnetic 
spectrum. In imaging at > 10,000 square degree coverage, 2MASS is now 
being surpassed (by 4 magnitudes) by the Vista Hemisphere Survey (VHS) 
(McMahon ct al., 2012, in preparation). 
- SDSS (e.g., York et al. 2000; Stoughton et al. 2002): The primary science 
goals of the (imaging & spectroscopy) Sloan Digital Sky Survey were fo- 
cused on galaxy evolution, large-scale structure and quasars, and the 5-band 
imaging survey 'avoided' much of the Milky Way by largely restricting it- 
self to |6| > 30°. Nonetheless, SDSS imaging has had tremendous impact on 
mapping the Galaxy. Like 2MASS, its impact has been most dramatic for 
understanding the outskirts of the Milky Way, where its imaging depth (giant 
stars to 100 kpc, old main-sequence turn-off stars to 25 kpc), its precise colors 
and its ability to get photometric metallicity constraints (Ivezic et al. 2008) 
have been most effective. On this basis, SDSS has drawn up a state-of the 
art picture of the overall stellar distribution (Juric et al. 2008), drawn the 
clearest picture of stellar streams in the Milky Way halo (Belokurov et al. 
2006), and expanded the known realm of low-luminosity galaxies by two or- 
ders of magnitude (e.g., Willman et al. 2005). SDSS photometry and proper 
motions {Sfi ~ 3 mas yr~^ through comparison with USNOB, (Munn et al. 
2004) have allowed a kinematic exploration of the Disk (Fuchs et al. 2009; 
Bond et al. 2010). However, the bright flux limit of SDSS {g ~15) makes an 
exploration of the Solar neighborhood within a few 100 pc actually difficult 
with these data. 
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- PanSTARRSl (Kaiser et al. 2010): PanSTARRS 1 (PSl) is carrying out 
a time-domain imaging survey that covers 3/4 of the sky {6 > —30°) in 
five bands to an imaging depth and photometric precision comparable to 
SDSS (e.g., Schlafly et al. 2012). PSl has imaging in the y band, but not in 
the u band, which limits its ability to determine photometric metallicities. 
However, it is the first digital multi-band survey in the optical to cover much 
of the Disk, including at 6 = both the Galactic Center and the Galactic 
Anticenter. 

- SkyMapper (Keller et al. 2007): The Southern Sky Survey with the 
SkyMapper telescope in Australia is getting underway in 2012, set to cover 
the entire Southern celestial hemisphere within 5 years to a depth approach- 
ing that of SDSS. Through the particular choice of its five filters, SkyMapper 
is a survey designed for stellar astrophysics, constraining metallicities and 
surface gravities through two blue medium band filters (u,w). Together with 
PanSTARRS 1, SkyMapper should provide finally full-sky sky coverage in 
the optical to _g < 21. 

3.3.2. Spectroscopic surveys 

- Geneva-Copenhagen Survey (GCS) (Nordstrom et al. 2004): This has 
been the first homogeneous spectroscopic survey of the Disk that encompasses 
far more than 1,000 stars. For ~ 13, 000 stars in the Galactic neighborhood 
(within a few 100 pc) that have Hipparcos parallaxes, it obtained Stromgren 
photometry and radial velocities through cross-correlation spectrometry, and 
derived Toft, [Fe/H], log g, ages, and binarity information from them. It has 
been the foundation for studying the Galactic region around the Sun for a 
decade. 

- SEGUE (Yanny et al. 2009): Over the course of its first decade, the SDSS 
survey facility has increasingly shifted its emphasis towards more systemat- 
ically targeting stars, resulting eventually in i? w 2000 spectra from 3800A 
to 9200A for ~350, 000 stars. This survey currently provides the best exten- 
sive sample of Disk stars beyond the Solar neighborhood with good distances 
{:^10%) and good abundances ([Fe/H], [a/Fe]). 

- RAVE (Steinmetz et al. 2006): RAVE is a multi-fiber spectroscopic survey, 
carried out at the AAO 1.2m Schmidt telescope, which, as of 2012, has ob- 
tained R« 7000 spectra in the red CaH-triplet region (8, 410A < A < 8, 795A) 
for nearly 500,000 bright stars (9 < / < 13). RAVE covers the entire South- 
ern celestial hemisphere except, regions at low \b\ and low \£\. The spectra 
deliver velocities to < 2km s^^, Tcff to ~ 200 K, log g to 0.3 dex, and seven 
individual element abundances to 0.25 dex (Boeche et al. 2011). At present, 
precise distance estimates, even for main-sequence stars, are limited by the 
availability of precise (at the 1 — 2%-level) optical colors. 

- APOGEE (AUende Prieto et al. 2008): The APO Galactic Evolution Exper- 
iment (APOGEE), is the only comprehensive near-IR spectroscopic survey 
of the Galaxy; it started in the Spring of 2011 taking R « 22, 500 spectra in 
the wavelength region 1.51^m < A < 1.70^m for stars preselected to likely 
be giants with H < 13.8 mag. It aims to obtain spectra for eventually 100,000 
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stars that yield velocities to better than 1 km s , individual element abun- 
dances and log g; the APOGEE precisions for [X/H] and log g have yet to be 
verified. The lower extinction in the near-IR Ah — Ay/ 6 enables APOGEE 
to focus on low latitude observations, with the majority of spectra taken with 
\b\ < 10°. 

- Gaia-ESO (Gilmore et al. 2012): The Gaia-ESO program, is a 300-night 
ESO public survey, which commenced in early 2012, and will obtain 100,000 
high-resolution spectra with the GIRAFFE and UVES spectrographs at the 
VLT. It will sample all Galactic components, and in contrast to all the other 
surveys, will obtain an extensive set of analogous spectra for open clusters 
with a wide rage of properties; this will constitute the consummate calibra- 
tion data set for 'field' stars in the Disk. The stars targeted by Gaia-ESO will 
typically be 100-times fainter than those targeted by APOGEE, including a 
majority of stars on the main sequence; however, taking spectra at wO.Sy^tm, 
Gaia-ESO will not penetrate the dusty low-latitude parts of the Disk. 

— LAMOST (Deng et al. 2012): The most extensive ground-based spectro- 
scopic survey of the Galaxy currently underway is being carried out with the 
LAMOST telescope. The Galactic Survey, LEGUE, has just started towards 
obtaining moderate-resolution spectra {R « 2000) for 2.5 Million stars with 
r < 18. In the context of the Disk, LEGUE is expected to focus on the Milky 
Way's outer disk, carrying out the majority of its low-latitude observations 
towards the Galactic anti-center. 

3.3.3. The survey road ahead: Gaia 

Gaia is an astrometric space mission currently scheduled to launch in September 
2013 that will survey the entire sky down to 20th magnitude in a broadband, 
white-light filter, G. A recent overview of the spacecraft design and instruments, 
and the expected astrometric performance is given in de Bruijne (2012); the 
expected performance of Gala's stellar parameters and extinction measurements 
is given in Liu et al. (2012). For Disk studies in particular, Gaia will obtain 
10 percent measurements of parallaxes and proper motions out to about 4kpc 
for F- and G-typc dwarfs, down to (non-extinguished) G ~ y = 15, for which 
the Gaia spcctro-photomctry also provides line-of-sight velocities good to ~ 5 to 
10 km s^^ and log g and [M/H] good to 0.1 to 0.2 dex. Overall, Gaia will observe 
approximately 400 million stars with Grvs < 17 — where Grvs is the integrated 
flux of the Radial Velocity Spectrometer (RVS) — for which line-of-sight velocities 
and stellar parameters can be measured (Robin et al. 2012). 

While high-precision samples from Gaia will provide an enormous improve- 
ment over current data, it is important to realize that most Gaia projections are 
in the non-extinguished, non-crowded limit and Gala's optical passbands will be 
severely hampered by the large extinctions and crowding in the Galactic plane. 
In practice, this will limit most Disk tracers to be within a few kpc from the Sun, 
and Gaia will in particular have a hard time constraining large-scale Disk asym- 
metries that are only apparent when looking beyond the Galactic center (the 'far 
side' of the Galaxy, D > 10 kpc, \£\ < 45°). Gala's lack of detailed abundance 
information beyond [M/H] also means that it probably will need to be accompa- 
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nicd by spectroscopic follow-up to reach its full potential for constraining Disk 
formation and evolution. Some follow-up is being planned (e.g., 4M0ST; de Jong 
2011), but no good studies of the trade-offs between, for example, sample size 
and abundance-precision have been performed to date. 

4. From surveys to modeling: characterizing the survey selection 
functions 

Spectroscopic surveys of the Milky Way are always affected by various selection 
effects, commonly referred to as 'selection biases'^. In their most benign form, 
these are due to a set of objective and repeatable decisions of what to observe 
(necessitated by the survey design). Selection biases typically arise in three forms: 
a) the survey selection procedure, b) the relation between the survey stars and 
the underlying stellar population, and c) the extrapolation from the observed 
(spatial) volume and the 'global' Milky Way volume. Different analyses need not 
be affected by all three of these biases. 

In many existing Galactic survey analyses emphasis has been given — in the 
initial survey design, the targeting choices and the subsequent sample culling — to 
getting as simple a selection function as possible, e.g.Kuijken & Gilmore (1989a); 
Nordstrom et al. (2004); Fuhrmann (2011); Moni Bidin et al. (2012). This then 
lessens, or even obviates the need to deal with the selection function explicitly 
in the subsequent astrophysical analysis. While this is a consistent approach, 
it appears not viable, or at least far from optimal, for the analyses of the vast 
data sets that are emerging from the current 'general purpose' surveys. In the 
context of Galactic stellar surveys, a more general and rigorous way of guarding 
against these biases and correcting for them has been laid our perhaps most 
explicitly and extensively in Bovy et al. (2012d), and in the interest of a coherent 
exposition we focus on this case. 

Therefore, we use the example of the spectroscopic SEGUE G-dwarf sample 
used in Bovy et al. (2012d) — a magnitude-limited, color-selected sample part of 
a targeted survey of ~ 150 lines of sight at high Galactic latitude (see Yanny et al. 
2009). We consider the idealized case that the sample was created using a single 
g — r color cut from a sample of pre-existing photometry and that objects were 
identically and uniformly sampled and successfully observed over a magnitude 
range rmin < r < J'max- We assume that all selection is performed in dereddened 
colors and extinction-corrected magnitudes. At face value, such a sample suffers 
from the three biases mentioned in the previous paragraph: a) the survey se- 
lection function (SSF) is such that only stars in a limited magnitude range are 
observed spectroscopically, and this range corresponds to a different distance 
range for stars of different metallicities; b) a. g — r color cut selects more abun- 
dant, lower-mass stars at lower metallicities, such that different ranges of the 
underlying stellar population are sampled for different metallicities; c) the dif- 
ferent distance range for stars of different metallicities combined with different 

It could be argued that a 'selection effect' only causes a 'bias' in the analysis, when 
not properly accounted for. 



The Milky Way's Stellar Disk 



29 




Fig. 7 Spectroscopic survey selection functions: this Figure illustrates two es- 
sential elements of the selection functions in spectroscopy surveys that must 
be accounted for rigorously in analyses of spectroscopic surveys of stars in the 
Galaxy. The specific case is taken from Bovy et al. (2012d) and discussed in Sec- 
tion 6.3. The left panel shows in grayscale the number density of stars with SDSS 
photometry (presumed to be complete) within the G-dwarf target selection box 
in (r, g — r) space; the contours show the distribution of stars that resulted in 
successful spectroscopic catalog entries (the spectroscopic completeness), after 
'bright' and 'faint' plates were taken (see Bovy et al. 2012d) obviously the dis- 
tributions differ distinctly, as also the marginalized histograms show. The right 
panel shows the fraction of 'available' G-dwarf targets that were assigned fibers; 
clearly that fraction varies dramatically with Galactic coordinates. 

spatial distributions means that different fractions of the total volume occupied 
by a stellar population arc observed. 

We assume that a spectroscopic survey is based on a pre-existing photomet- 
ric catalog, presumed complete to potential spectroscopic targets. The survey 
selection procedure can then ideally be summarized by a) the cuts on the pho- 
tometric catalog to produce the potential spectroscopic targets, b) the sampling 
method, and c) potential quality cuts for defining a successfully observed spec- 
trum (for example, a signal-to-noise ratio cut on the spectrum). We will assume 
that the sampling method is such that targets are selected independently from 
each other. If targets are not selected independently from each other (such as, 
for example, in systematic sampling techniques where each 'N'-th item in an 
ordered list is observed), then correcting for the SSF is more complicated. For 
ease of use of the SSF, spectroscopic targets must be sampled independently 
from each other. 

The top panel of Fig. 7 shows the relation between the underlying, complete 
photometric sample and the spectroscopic sample for the SEGUE G-dwarf sam- 
ple. While the sampling in color is close to unbiased, the sampling in r-band 
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magnitude is strongly biased against faint targets because of the signal-to-noise 
ratio cut (> 15 in this case). 

The selection function can then be expressed as a function S{r) of the relevant 
quantities r from the photometric catalog; this function expresses the relative 
fraction of entries in the spectroscopic catalog with respect to the complete 
photometric catalog, as a function of r. We will not concern ourselves here 
with how this function is derived for the spectroscopic survey in question; an 
example is given in Appendix A of Bovy at al. (2012d) and Fig. 7. We assume 
that the SSF is unbiased in velocity space, that is, that all velocities have equal 
probability of being observed, such that the SSF only affects analyses concerned 
with the spatial densities of objects. If we then want to infer the spatial density 
in a; = (i?, z, (j)) of a set of spectroscopic objects, we need to constrain the joint 
distribution \{9) of r, a;, and whichever other parameters / are necessary to 
relate a; to r (for example, when photometric distances are used, the metallicity 
can be used in addition to purely photometric properties to calculate x); we 
denote all arguments of A as = (r, a;, /). The joint distribution can be written 
as 

X{r,xJ)=p{rJ\x)v,{x)\J\S{r), (4) 

where p{r,f\x) is the distribution of r and / as a function of x, and |J| is a 
Jacobian, transforming from the heliocentric frame to the Galactocentric one. 
As discussed in Bovy et al. (2012d), the correct likelihood to fit is 

where the product is over all spectroscopic data points. This likelihood simply 
states that the observed rate is normalized over the volume in space that could 
have been observed within the survey selection constraints as expressed by the 
SSF. 

For example, in the SEGUE example discussed above, we may further assume 
for simplicity that the distance is obtained simply as d{r, g — r) (that is, ignoring 
the metallicity, such that there are no /) with no uncertainty, and that the 
distribution of colors g — r \s uniform over the observed color range, then \{9) 
can be written as 

A(r, 5 — r, a;) = 5{r — r[x,g — r, I, b]) iy^{x) \J\ S{r, g ~ r,£,b) , (6) 

where I and b are Galactic longitude and latitude, respectively; 6{r — r[x,g — 
r,i,b]) is a Dirac delta function that expresses the photometric distance. The 
likelihood then reduces to 

^ TT Mxj) 

\^ Jddd{g^r)dldbi^4x{r,g-rJ,b))\J\S{r,g~r,£,b)' ^ ' 

where we have assumed that we only want to fit parameters of (such that the 
SSF can be dropped from the numerator). For a survey of a limited number of 
lines of sight such as SEGUE, the integral over I and b can be re-written as a 
sum over the lines of sight. 
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Fig. 8 Distribution of stellar element abundances ([Q;/Fe],[Fe/H]) from 
Bevy at al. (2012b): the top panel shows the number density distribution of 
stars as they occur within the SDSS/SEGUE sample; the bottom panel shows 
the stellar- mass- weighted, |z|-integrated distribution, which is quite dramati- 
cally different, illustrating the importance of incorporating the sample selection 
function and the stellar mass weighting. 



Assuming that the SSF does not depend on the velocity, fitting a joint dis- 
tribution function model for the positions and velocity, for example when fitting 
dynamical models to data, uses a similar expression, with the density i/^ simply 
getting replaced by the distribution function, and the integral in the denomi- 
nator in equation (5) includes an additional integration over velocities v. For 
example, in the context of the simplified SEGUE example, we fit a DF f (a;, v\p) 
with parameters p using the likelihood 

C(p)=T\ n^^,V^\P) 

^■^^ Jdrd{g-r)dldbdvi{x{r,g-r,£,b),v\p)\J\S{r,g-r,e,b)' ^' 

Correcting for selection bias b) requires stellar-population synthesis models, 
to connect the number of stars observed in a given color range to the full un- 
derlying stellar population. An example where this correction is performed by 
calculating the total stellar mass in a stellar population given the number of 
stars in a given color range is given in Appendix A of Bovy et al. (2012b). The 
total stellar mass is calculated as 

-iV(M)/^-/, (9) 

where N is the number of stars and (M) is the average mass in the observed color 
range, and Jm is the ratio of the total mass of a stellar population to the mass in 
the observed range. All of these can be easily calculated from stellar-population 
synthesis models (see Appendix A of Bovy et al. 2012b). 

The correction for bias c) above involves extrapolating the densities of stars 
in the observed volume to a 'global' volume. This is useful when comparing the 
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total number of stars in the Milky Way in different components. For example, 
Bovy et al. (2012b) calculated for different MAPs the total stellar surface density 
at the solar radius as the 'global' quantity — also correcting biases a) and b) 
above — extrapolating from the volume observed by SEGUE (see below). This 
extrapolation requires the spatial density of each component to calculate the 
fraction of stars in the observed volume with respect to the 'global' volume. 
Correcting for this bias is then as simple as multiplying the number of stars in 
the observed volume by this factor. 

5. Dynamical modeling of the Disk 

5.1. Goals 

To recall from the Introduction, the goals of dynamical modeling in the Disk con- 
text are two-fold: learn about the gravitational potential, p(<P(a;, t)|{data}), and 

learn about the chemo-orbital distribution of Disk stars, p( J, </>, [X/H], tagc|^(35, t)) 
The lengthy, explicit notation makes it clear that we need to evaluate the grav- 
itational potential in light of all the data, and that we need to marginalize over 
the different (l>{x,t) to learn about the orbital distribution. That is, a simul- 
taneous solution for potential and orbits is needed (e.g., Binney & McMillan 
2011). This is a challenge that much of the existing dynamical analyses of the 
Disk have bypassed: when focusing on orbital structure, they have assumed a 
'fiducial potential' (e.g., Helmi et al. 1999; Klement et al. 2008; Dierickx et al. 
2010), or by simplifying the treatment of the orbital structure, e.g., by the 
use of the Jeans Equation (e.g.. Just & Jahreifi 2010) or an ad hoc simplified 
distribution function. Also, basically all dynamical models have focused on 
steady state models, <P{x, /;), or 'P{x), an assumption that seems wise to retain 
until comprehensive equilibrium modeling has actually been implemented. 

The gravitational potential near the Disk has conventionally been character- 
ized separately by the 'rotation curve', Vc{R) and by the 'vertical force' Kz = 
^{Rq, z) at the Solar radius, which describes the surface mass density near the 
mid-plane, -£"(< z,Rq). Good basic estimates for both quantities have been in 
place for two decades, I'dR) ~ 225 km s^^ and S{< l.lkpc, i?o) — 75Mqpc— ^, 
with quoted uncertainties at the 10% to 15% level. The challenge therefore is 
to exploit the emerging wealth of data to take this to the next precision level, 
to obtain analogous constraints over a wider portion of the Disk and to even- 
tually make statements about the non-(axi-) symmetric aspects of the potential. 
At the same time, one would like to eliminate uncertainties in ${x) as a factor 
in analyzing the stellar orbit (sub-) structure. 

These aspects alone imply that a methodology has to be in place to constrain 
the potential to 6(!>/'P < 0.1. In addition, one should expect that the ability of 
data sets to discriminate models grows as -\/A^data, where A^data is the number of 
data points with useful (x^v) information. By now, A^data has reached 10^, with 
Gaia, it will be > 10 ''^^: this puts enormous demands on the 'modeling' if the 
information content of the data is to be exploited. As even the Jeans Equation in 
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its simplest (ID) form illustrates, dynamical modeling always involves the link 
between the gravitational potential, the kinematics of tracers and the (gradients 
of the) spatial density (ly) of tracers, 



this illustrates the paramount importance of knowing the selection function of 
any survey, which enters the determination of vtracev 

It this context it then also becomes no longer tenable to view the stellar Disk 
in isolation, as the mass contributions from the dark matter and the ISM must 
be taken into account. 

5.2. Approaches 

To date, the majority of constraints on the Galactic potential from Disk stars 
have been derived through the Jeans Equation (cf. Binney & Tremaine 2008; 
e.g., Holmberg & Flynn 2004; Bovy & Tremaine 2012; Zhang et al. 2012; see 
also Section 5.3). This approach has been sensible, because for a relatively cold 
and thin disk, one has a clear prior on the orbital structure: most orbits should 
be 'nearly-circular', and the deviations from this can be treated as vertical or 
in-plane oscillations around a guiding center. 

However, solving the Jeans Equations neither delivers a distribution function 
for the stars, nor can it provide constraints on (p marginalized over the possible 
distribution functions; it only can provide a ^-constraint conditioned on a par- 
ticular (yet ill-specified) distribution function. Hence, more rigorous modeling 
approaches that explicitly account for the orbit-nature of the stars are needed 
in the Disk context, both given the quality of the data and the subtlety of the 
questions one would like to address. Mostly in the context of modeling external 
galaxies or star clusters, three ultimately related approaches have been developed 
over the last decades. 

— Methods based on distribution functions: Assuming families of analytic dis- 
tribution functions, for which p{x, v) can be predicted in a given potential 
'P{x) and compared to the data {x,v}. By varying (l>{x) and finding the 
distribution function whose prediction matches the data in each case, one 
obtains C(^{x,v}\(l>{x)y, this approach also yields the distribution function, 

DFbost(^^best)- 

— Orbit-based methods: This entails the prediction of p{x, t;|orbit) by explicit 
calculation of these orbits ('Schwarzschild method'), where the best potential 
is found again by varying ^(a;) and optimizing the 'weights' of all calculated 
orbits to match the data. Among the advantages of this approach are that no 
explicit distribution function needs to be spelled out, which is difficult to do 
in, e.g., non-axisymmetric potentials; among the disadvantages is that it is 
difficult to ensure that the distribution function-space has been well-sampled 
by the discrete orbits. This approach has been implemented by Rix et al. 
(1997), Gebhardt et al. (2003), and van den Bosch et al. (2008). 
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— Particle-based methods: Iteratively modifying a self-consistent iV-body 
model, so that its predictions increasingly better match a set of given 
observations ('made-to-measure'). This approach has the advantage that no 
orbit library needs to be stored and that it makes no explicit symmetry 
assumptions; at least in simple cases, it is shown to recover the correct 
distribution function. However, by its particle nature, its outcome is only 
one specific particle sampling of the underlying distribution function. This 
approach has been originally devised by Syer & Tremaine (1996), and its 
practical applications for galaxy modeling have been pursued foremost by 
Gerhard and collaborators (e.g., Morganti & Gerhard 2012)). 

5.2.1. Data-model comparison in dynamical modeling 

Modeling the Disk also calls for data vs. model comparisons that differ from those 
most commonly used in stellar dynamical modeling. Usually, the data-model 
comparison is presented by comparing the surface-brightness, mean- velocity and 
velocity dispersion profiles with data that have been binned or never been re- 
solved into individual stars. The case of the Disk, or most other parts of the Milky 
Way, is of course different, in at least two respects: first, the data are obtained 
star-by-star, where each star has uncertainty estimates for each of its phase-space 
coordinates; second, our position within the disk means that the direct dynami- 
cal observables {data} = p(uios, D) map into very different components of 
{x,v)qc- Further, the Sun's motion with respect to the Galactic rest-frame sys- 
tem and with respect to any sensible local co-rotating reference frame ('standard 
of rest') is still under considerable debate (e.g., Schonrich et al. 2010; Bovy et al. 
2012a). This enters into the {vios, fi,i,b, D) {x,v)gc transformation. Finally, 
the size of the uncertainties in (z;ios, M, i, b, D) will vary dramatically among sam- 
ple members, e.g., if some sample stars have vios measurements and others do 
not. 

While this calls for an approach beyond comparing tracer density and dis- 
persion profiles, a data-model comparison is still straightforward for any model 
that predicts n^{x, v), or i'^,{x) a.nd p{v\x) , for a given <P{x). Rather than asking 
models to match binned moments of the observables, :^*(i?, z) and (7r/^/z{R, z), 
one simply calculates the likelihood of the individual data £({data},^*-^|model), 
see Bovy et al. (2012d). 

5.3. Recent results 

In light of the vastly better data, the recent progress in understanding the dy- 
namical properties of the Milky Way and the local (few kpc) Galactic potential 
may seem disappointing: the debate around the 'circular velocity at i?o' has not 
abated, nor have the constrains on the stellar and DM mass distribution near Rq 
tightened by significant factors. Before sketching the road towards comprehen- 
sive dynamical modeling (Section 7), it is worth looking at some of the recent 
results to describe the status quo, first for Wcirc(-R) then for Kz{z\Ro). 
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5.3.1. The Disk's 'circular velocity' 

Characterizing the Milky Way's central mass distribution by its circular velocity 
has tradition and is (operationally) sensible. For an axisymmetric galaxy, Wcirc(^) 
is simply Rd(I>/dR{R,z ~ 0). The 'rotation curve', especially Wcirc(— ^o), can 
be constrained in many different ways: by considering the reflex motion of the 
Galactic Center (e.g., Chez et al. 2008; Reid et al. 2004), by measuring the ve- 
locities of globular clusters or halos stars (e.g., Sirko et al. 2004; Deason et al. 
2011), considering them to be a non-rotating component, by measuring line-of- 
sight or 3D velocities to ISM tracers (Fich et al. 1989; Reid et al. 2009), pre- 
sumed to be on near-circular orbits, by determining an overall mass model 
for the Galaxy and its halo and interpolating inferences to {Rq,z ~ 0) (e.g., 
Xue et al. 2008; Koposov et al. 2010; McMillan 2011), or by mapping and model- 
ing the stellar velocities across a sufBciently large portion of the Disk (Bovy et al. 
2012a; Schonrich 2012). Over the last years, this has led to (published) es- 
timates of Wcirc(— Rq), ranging from 215 to 255km s^^, with little seeming 
progress since the lAU recommended a value of 220 km s^^, nearly 30 years 
ago (Kerr & Lynden-Bell 1986). 

This range of values, given the precision that the data seemingly can offer, 
can be traced to the fact a) that the Sun's motion with respect to Wciic(-Ro) is 
uncertain at the ~10km s~^ level, b) that the Galactic potential is not axisym- 
metric presumably at the ~ 5% level, and c) that (hence) cold tracers do not 
move on circular orbits. Judging from external galaxies (e.g., Rix & Zaritsky 
1995), non-axisymmetries in the gravitational potential at i?o in a Milky- Way- 
like galaxy are expected to be at the 5 to 10% level, with causes ranging from 
lopsidedness (m = 1), to spiral arms in the stellar disk mass distribution, to 
bars in the center, and potentially asymmetric dark matter halos. This leads to 
smooth azimuthal variations in of closed orbits, the closest equivalent to Wcirc- 
As of now, the majority of the analyses have focused on the nearby half of the 
Disk, or the local quadrant. This makes analyses susceptible to such asymmetries 
and makes it hard to test for them. The observability of ISM tracers, such as 
masers (Reid et al. 2009), depends potentially strongly on their orbital phase; 
therefore the assumption that the orbital phases are random, underlying most 
analyses, is poorly justified. In principle, stars are better tracers of a 'smooth 
rotation curve', as their finite kinetic temperature reduces their response to non- 
axisymmetric potential perturbations. For them, however, a model accounting 
for their velocity dispersion, and corresponding lag in w^, the asymmetric drift 
must be made (see Bovy et al. 2012a). 

In the context of axisymmetric models, Bovy et al. (2012a) has thorough ana- 
lyzed new stellar radial velocities and approximate distances from the APOGEE 
survey, and has extensively explored and accounted for a wide range of uncer- 
tainties and model assumptions. They find a rotation curve that is very close to 
fiat and has Vcm = 218 ±6 km . They show that this result is consistent with 
all other existing determinations. This result, however, implies that the Sun's 
azimuthal velocity, v^, is 24 km s~^ higher than ?;circ(^o)- The main remaining 
model limitation in Bovy et al. (2012a)'s approach is that the possible conse- 
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quences of non-axisymmetries have not been thoroughly explored, not even the 
known ones that must arise from the Galactic bar. 

5.3.2. The potential perpendicular to the Disk 

The second focus of dynamical Disk modeling has been the study of the gravita- 
tional potential perpendicular to the Disk at ~ i?o- Broadly speaking, Kz{z) = 
\d(l>/dz{Ro, z)\ is expected to vary hnearly for small z (~ 200 pc), as the en- 
closed (presumably) stellar surface mass density U^{< \z\) grows linearly; for \z\ 
above the dominant layer of Disk mass, the if^-profile is expected to flatten, as 
for plane parallel geometry on has Kz{z) — )■ const, for Ut{< \z\) = const. Once, 
or if, a spheroidal dark matter distribution becomes dominant while \z\ ^ Rq 
still approximately holds, one expects again Kz{z) cx z; for a given radial dark 
matter profile, pBuif), the relation Kz{z) cx pdm(^0:-z = 0) x z provides a 
constraint on the local dark matter density. 

Building on the seminal work of Kapteyn (1922) and Oort (1932), who coined 
the term if^-force, the analysis of Kuijken & Gilmore (1989a, b) has set the stan- 
dard for such an analysis for decades. They adopted a parameterized ID vertical 
force law of the form 

K,{z) = 2nG{Eo x ^ / ) + 4^GpDM x z, (10) 

where Sq is the integrated disk surface mass density at the Sun's radius (includ- 
ing the contribution from the cold interstellar medium). The thickness of the 
stellar distribution is described by Zh and the DM density at {Rq, 0) is described 
by pdm- In order to break the degeneracy between the two iiT^-terms that scale 
linearly with z, it is necessary that 2So.*/hz is substantially larger than pdm, 
which turns out to be the case. 

One the one hand, it is therefore necessary to have observational constraints 
with |z| ^ hz] on the other hand the present analysis context is limited by the 
requirement that a one-dimensional analysis in cylindrical coordinates makes 
sense: taken together this leads to a vertical range over which constraints are 
needed of |zmax| —1 to 4kpc. 

Kuijken & Gilmore (1989a) laid out how to link the observables — the vertical 
density distribution of tracer particles, j/* (z) and their vertical velocity dispersion 
profile, CTz{z) — to Kz{z). Their approach and all subsequent ones have exploited 
variants of the Jeans Equation, where — of course — it is crucial that the radial 
dependence of all properties is correctly taken into (e.g., Kuijken & Gilmore 
1989a; Bovy & Tremaine 2012). These radial terms matter least in the case of a 
perfectly flat rotation curve near Rq , which fortunately or fortuitously seems to 
be an excellent approximation (e.g., Bovy et al. 2G12a). 

Based on an approximately volume-complete sample of K stars towards the 
Galactic pole, Kuijken & Gilmore (1989b) and Kuijken & Gilmore (1991) found 
at the time that the best determined quantity was i^<i.ikpc = 71 ± 6Mq pc^^, 
with a likely baryonie disk mass of 48 ± 8 Mq pc~^, and no evidence, or at least 
no data-driven need, for disk dark matter near the Sun. A number of concep- 
tually similar analyses have been carried out since (e.g., Flynn & Fuchs 1994; 




Fig. 9 The (vertical) Galactic potential at the Solar radius. This Figure, taken 
from Zhang et al. (2012) summarizes various estimates for <P{z\Rq): the left 
panel shows various estimates of Kz{z), where the initial steep rise of Kz{z) 
reflects the stellar and gaseous disk mass, and the slope beyond ~ 1 kpc reflects 
the local dark matter density: the red lines show the results from Zhang et al. 
(2012) with the best fitting pdm (solid line) and with puM = 0.008 Mq pc~^ from 
Bovy & Tremaine (2012) (dashed line); the fat portions of the red lines show 
the |z|-range that are directly constrained by the SEGUE data. The dashed line 
shows the result from Kuijken & Gilmore (1991), the gray dash-dotted line for 
z < 1.5 kpc the result from Holmberg & Flynn (2004), and the dash dotted line 
beyond z ~ 1.5 kpc the result from Bovy & Tremaine (2012). The right panel il- 
lustrates recent estimates of the implied local dark matter density: Garbari et al. 
(2012) in blue, Bovy & Tremaine (2012) in black, and Zhang et al. (2012) in red; 
the open histogram shows the joint probability of these estimates. 



Siebert et al. 2003; Holmberg & Flynn 2004): these confirmed the approximate 
values for Z'<ikpc, and Siebert et al. (2003) obtained a first constraint on the 
dynamical thickness of the stellar disk layer. All of these studies were inconclu- 
sive on estimating local dark matter, as the data could neither rule out poM 
nor pdm — 0.06 — 0.12 pc~'^, the value expected for the inward extrapolation 
of global dark matter halo fits (e.g., Xue et al. 2008; Deason et al. 2012). 

This has recently changed, as three studies have claimed significantly non- 
zero pdm estimates from K^-^ype experiments. Garbari et al. (2012) reanalyzed 
literature data, but properly marginalized over a number of assumptions and 
parameters in previous analyses, and inferred pdm = 0.025to^Q^4 Mq pc^'^, po- 
tentially indicating a flattened halo. Bovy & Tremaine (2012) re-analyzed a set 
of ~ 400 stars with distances and kinematics assembled by Moni Bidin et al. 
(2012), and obtained an estimate of 0.008 ± 0.003 Mq pc~'^ (where the uncer- 
tainties also incorporate a systematic component); despite the modestly-sized 
sample this local dark matter Milky Way was enabled by the large vertical ex- 
tent of the tracer stars (~ 4 kpc) , sampling high above the disk. 
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All of these analyses had to carefully model the mctallicity (selection) dis- 
tribution of the stars, as the vertical density profile and effective cr^ disper- 
sion depend sensitively on the abundance mix of the sample stars (see Sec- 
tion 6.2). Zhang et al. (2012) tackled this limitation, by analyzing the Kz{z)- 
force problem considering and fitting abundance-selected, nearly isothermal sub- 
samples separately. Using ~9000 K-type dwarfs from SDSS/SEGUE, they found 
0.0065 ± 0.0025 M0pc"3, which in conjunction with Bovy & Tremaine (2012) 
yields pdm = 0.0075 ± 0.0021 Mq pc^^, or 0.28 ± 0.08 GeVcm-^. 

They also obtained ^<i.i kpc = 68±6 M0 pc~^, consistent with Kuijken & Gilmore 
(1991) and other previous results. It may seem startling that the error bars have 
not become substantially smaller. However, recent determinations have fitted 
directly for many more aspects of the model, rather than simply assuming a 
prior value, which broadens their confidence limits, even with larger samples. 
For example, Zhang et al. (2012) determined not only the tracer scale-heights 
consistently, but also fitted the Disk's effective mass scale height, finding 
100 pc < Zh < 350 pc. 

To date, the dark matter constraints from Kz{z) analyses corroborate other 
evidence, but are not yet better than other approaches. This should to change 
in the near future, as we outline below. 

6. New ways of looking at the Milky Way's stellar Disk 

6.1. Mono-abundance sub-populations 

We now lay out in more detail why we believe that dissecting the Disk in terms 
of 'mono-abundance stellar populations' (MAPs), i.e. in term of stellar subcom- 
ponents with very similar abundances (e.g.[Fe/H] and [a/Fe]), is a productive 
way forward, both for studying galaxy evolution and for dynamical modeling. 
We do so by synthesizing a number of recent results. We also show that for the 
most part this new (or, newly implemented) way of looking at the Disk is largely 
consistent with a range of earlier results when properly compared. 

Work over the last 30 years has shown that the Disk is complex, with stars 
of different ages and abundances showing a different dynamical structure (e.g., 
Gilmore et al. 1989)). To simplify the problem of understanding the Disk, it 
is sensible to dissect it into different components. A dissection into a thin 
and a thick disk, either by spatial or by kinematical criteria seems obvious. 
But the hierarchical assembly and secular evolution processes erase, or at least 
diffuse, dynamical memory with time (Wielen 1977; Sellwood & Binney 2002; 
Kormendy & Kennicutt 2004; Schonrich & Binney 2008; sec Section 7). Further, 
any (sub-)samplc selection based on spatial or kinematic criteria feeds back into 
the inferred structure of that component in very complex ways. This leaves 
'age' and 'chemical abundances' [X/H] as lifelong tags to mark sub-populations 
(Bland- Hawthorn et al. 2010), and disk sub-components have long been defined 
that way (Fuhrmann 2011). Because good age determinations, spanning 1 to 12 
Gyrs, are at present only available for tiny volumes (~ 10^'^kpc'^; where paral- 
laxes exist), this leaves 'mono-abundances' criteria as the sub-population marker 
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Fig. 10 The geometry of 'mono-abundance populations' (MAPs), as derived from 
SDSS/SEGUE data in Bovy et al. (2012d). The Figure shows hues of constant 
stellar number density (red, green and blue) for MAPs of decreasing chemi- 
cal age (i.e. decreasing [a/Fe] and increasing [Fe/H], with color coding analo- 
gous to Fig. 11), illustrating the sequence of MAPs from 'old, thick, centrally 
concentrated' to 'younger, thin, radially extended'. The Figure also puts the 
SDSS/SEGUE survey geometry into perspective of the overaU Galaxy (here 
represented by an image of NGC 891). 

of choice for the near future. Of course, abundances and ages are linked through 
chemical enrichment, making 'mono-abundances' a sensible, albeit qualitative, 
proxy for mono-age populations, an issue which we explore in Section 7. The 
concept of MAPs is different from a mere abundance-based thin-thick disk dis- 
tinction on the basis of abundances: it presumes nothing about the spatial or 
kinematic properties of stars at a given [X/H], nor does it presume that there is 
a small number of distinct components. 

As we will see, the consideration of MAPs allows quite direct inferences about 
galaxy formation, but it has also two advantages for dynamical modeling: first, in 
dynamical modeling it is convenient to have tracer populations that have simple 
orbit distribution function properties, which MAPs turn out to have. Second, all 
MAPs live in the same gravitational potential, and hence provide opportunities 
to cross-check dynamical inferences (cf. Section 5.3.2). 

6.2. Properties of the Disk's mono- abundance sub-populations 

Following Bovy et al. (2012b, c,d) we now lay out the results of analyzing MAPs 
in the Disk, drawing on spectra from SDSS/SEGUE^. Again, qualitatively we 

^ Analogous analyses could and should be carried out with other specspectroscopic- 
troscopic surveys, such as RAVE, APOGEE, Gaia-ESO, etc. 
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are asking "What would the spatial, kinematical, and dynamical structure of the 
Disk look like, if we had eyes only for stars of a particular abundance?" In the 
present context 'abundances' refer to [Fe/H] and [a/Fc] only, both because such 
data are available from SDSS, and because they describe the bulk of the variation 
of individual abundances (e.g., Ting et al. 2012a). While the Bovy et al. (2012d) 
analysis goes well beyond the solar neighborhood (so thoroughly probed by the 
the GCS, but it still covers only 6kpc < R < lOkpc and 0.3 kpc < \z\ < 2kpc 
(see Fig. 10), not the entire Disk. In practical terms, MAPs mean ensembles of 
stars whose abundances are within ~ 0.15dcx in [Fe/H] and ~0.08dex in [a/Fe] 
(Bovy et al. 2012c), leading to about 50 different MAPs in Bovy et al. (2012d). 
Given the total sample size of ~ 20, 000 G-typc dwarfs in SEGUE, typical MAP 
sample sizes in Bovy et al. (2012d) were a few hundred. 

As laid out in Section 2. [Fe/H] and [a/Fe] abundances depend on both the 
degree of chemical enrichment and the speed with which it occurred. In general, 
less [a/Fe] -enhanced and more metal-rich star probably have formed later (e.g., 
Schonrich & Binncy 2009). In particular for metallicity, the formation radius also 
plays an important role, as most galaxies show an outward decay in their mean 
metallicity (see Section 7). For language convenience, however, we will simply 
refer to [a/Fe] -enhanced stars as 'a-old', or 'chemically-old'. 

6.2.1. The spatial structure of mono- abundance populations 

As a main result from the Bovy et al. (2012d) analysis, the spatial structure of 
the Disk's MAPs turned out to be remarkably simple over the observed range: 
fitting a number-density model that is a simple exponential in both the z and 
the R directions matches the data well. If the model complexity is enhanced to 
include two vertical scale-heights (a 'thin' and 'thick' component) the data do 
not point towards two scale heights of significant mass fractions and significantly 
different scale heights for any MAP. Therefore, each MAP can be characterized 
by its scale length i?e, scale height and number-density normalization. 

The complexity of the Disk and the power of MAPs comes in when con- 
sidering the [Fe/H] - [a/Fe]-dcpendence of these structural parameters: Re and 
hz vary systematically with abundances, in a way that depends both on [Fe/H] 
and on [a/Fe], as shown in Fig. 11. Broadly speaking, there is a simple trend: 
chemically-older MAPs form thicker disk components with shorter radial scale- 
lengths. Among the MAPs, the scale-heights range from ~ 200 pc, the clas- 
sical 'thin-disk' regime to > kpc, the classical 'thick-disk' regime. Note that 
components with > 200 pc should still be well sampled by SDSS/SEGUE 
(|z| > 300 pc); while very thin, young components with hz ~ 100 pc could be 
missed. Similarly, the radial scale lengths of these MAPs vary widely: from 
i?e < 2 kpc for the chemically-old MAPs, to an essentially flat radial proflle. 
Re > 5 kpc for the chemically- young MAPs (with solar [a/Fe]) of low [Fe/H]; 
note that for thin components and large Re the SEGUE survey geometry is 
problematic (conflned mostly to I > 30°), as it samples only volumes will above 
the plane at R^ Rq- 

This analysis shows empirically that the Disk contains a continuum of (MAP) 
stellar components that form a sequence from chemically-old, metal-poor, thick. 
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Fig. 11 Spatial structure of the 'mono-abundanee populations' (MAPs) in 
the Disk from Bevy et al. (2012d). Each of the components, characterized by 
([Fe/H], [a/Fe]), can be well-described by a simple exponential density profile in 
both the radial direction and vertical direction. The left two panels show the ra- 
dial and vertical scales as a function of [Fe/H], [a/Fe]; the right two panels show 
Re VS. as a function of [a/Fe] and [Fc/H], respectively. This Figure illustrate 
that from the chemically-old to chemically-younger MAPs the structure changes 
systematical from 'old, thick, radially concentrated' to 'younger, thin, radially 
extended', but in a way that cannot be fully captured by [a/Fc] or [Fc/H] alone 
(cf. Bovy et al. 2012d). 



and radially concentrated to chemically-young, metal-rich, thin, and radially ex- 
tended. We compare these results with cosmological simulations in Section 7; but 
even at face value, these results point quite directly towards inside-out growth 
of the Disk. 

On second look, the distribution of the MAPs' structural parameters has 
some subtleties: e.g, for a given [a/Fe], the more metal-poor MAPs have longer 
scale lengths: there is an outward metallicity gradient in the disk. Finally, it is 
worth noting that Re and hz depend on abundances in a way that cannot be 
captured by any one-dimensional description of the abundances, i.e., by [Fe/H], 
[a/Fe], or any combination of them alone. This may not be surprising, as birth 
epoch and birth radius should be reflected in different ways in the two abundance 
coordinates. 
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Fig. 12 Vertical kinematics of MAPs from Bovy et al. (2012c). The right 
panel shows the vertical dependence of the vertical velocity dispersion of 
MAPs: all MAPs exhibit an isothermal vertical profile at an individual level 
of a few kms~^kpc~^; jointly they are consistent with being isothermal to 
0.3 km kpc~^. The left panel shows that the vertical velocity dispersion in- 
creases when moving from metal-rich, solar [a/Fc] to more metal-poor and/or 
a-enhanced MAPs, similar to the hz behavior in Fig. 11. 

This view of the Disk appears to be on first sight in stark contrast to purely 
geometric thin-thick Disk decompositions (e.g., Juric et al. 2008). However, the 
Bovy et al. (2012d) analysis is the only case to date of a large-scale Disk structure 
analysis where the disk components have been selected solely by a structure- 
independent property: [Fe/H] — [a/Fe]. In all other analyses beyond the Solar 
neighborhood, the disk components have been defined by their geometry, leaving 
inevitably some level of circularity. 

6.2.2. The kinematical structure of mono-abundance populations 

Analogously, the kinematical structure of individual MAPs turns out to be 
simple, in some sense a simple as can be (Bovy et al. 2012c): for each MAP, 
the velocity dispersions cr^ and an arc vertically approximately isothermal: 

o'z,B.{^\R, [X/H]) — const. In particular for ct^, which Bovy et al. (2012c) have 
investigated in detail, the degree of isotliermally is remarkable (see Fig. 12): the 
mean gradient is only \daz/dz\ = 0.2 ± 0.3km s~^ kpc~^. 

As a function of Galactocentric radius, the dispersions show a slow decrease, 
fz,ii(i?|z, [X/H]) (X exp (— (i? — i?o)/7kpc). These kinematics warrant careful 
dynamical modeling, but it seems qualitatively plausible that the redial de- 
pendence simply reficcts the decrease in the restoring force to the disk plane 
(Bovy et al. 2012c). 

Also analogous to the spatial structure, the characteristic velocity disper- 
sion of the MAPs shows a distinct pattern as a function of [Fe/H] and [a/Fe] 
(Fig. 12): the chemically older and thicker components have higher dispersions, 
as anticipated from hz and dynamics (Bovy et al. 2012c). 

As mentioned already in Section 3.2.2, Bovy et al. (2012c) used the isother- 
mality of MAPs to test the SEGUE abundance precision. This is possible, be- 
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cause the trends of thicker and kinematicaUy-hottcr components for more metal- 
poor, [a/FcJ-enhanced MAPs would lead to an increase in vertical velocity dis- 
persion as a function of height when substantial abundance errors lead to abun- 
dance mixing in MAPs. Bovy et al. (2012c) show that the degree of isothermality 
observed for MAPs requires an abundance precision of ~0.15dex in [Fe/H] and 
~0.07dex in [a/Fe], close to the stated SEGUE-pipeline precision. 

While the radial velocity dispersion seems to vary rather analogously to the 
vertical dispersion among the MAPs, there may be qualitative differences in the 
orbit structures of MAPs (especially at the extremes of [Fe/H] — [a/Fc] space), 
as pointed out by Liu & van de Ven (2012). Overall, the kinematics of all MAPs 
are dominated by orbits with relatively high angular momentum (Dierickx et al. 
2010; Wilson et al. 2011; Liu & van de Ven 2012), making a denotation of disk, 
rather than halo sensible. 

In summary, each MAP of the Disk has a very simple spatial and kinematical 
structure, their properties vary widely: the SDSS analysis yields components 
with az from 15 to 50km s~^, from 150 pc to 900 pc, and Re from ~1.5kpc 
to an essentially flat radial profile at Rq. 

6.2.3. The overall structure of the Disk 

Deconstructing the overall structure of the Disk into a large set of MAPs is a 
different approach to slicing the Disk than is commonly done. It behoves one 
then to explore what this implies for the overall structure of the Disk, when 
viewed as a superposition of MAPs. 

This requires some additional considerations in order to put the different 
MAPs on the same footing. At first, such components in any survey such as 
SEGUE are defined at first in terms of the volume corrected number-densities of 
the sample members. However, any operative sample definition, such as SEGUE's 
color cut 0.48 < {g — r) < 0.55, means that those stars stand in for differing 
fractions of the stellar mass of their underlying stellar population, depending on 
their [Fe/H] and tagc(see Section 4) By marginalizing over plausible age distri- 
butions, one can convert the z-integrated surface density of target stars into a 
surface mass density of stellar mass (Bovy et al. 2012b; Schlesinger et al. 2011). 

Along these lines, Bovy et al. (2012b) worked out what the surface-mass 
density distribution of each MAP is at the Solar radius, Sjig{[Fe/ll],[a/Fe]). 
As each MAP has a unique scale height hz, or kinematical temperature (Tz,r, 
associated with it, one can then answer: How much of the total stellar surface 
mass density at Rq comes from stars with a scale-height hz (or with a certain 
az)? If there was a distinct 'thin' and 'thick' disk, one would expect a bi-modal 
distribution in Z'/j^ (hz), with ~85% of it in a thin disk peak (hz — 150 to 250 pc) 
and ~15%in a thick disk (with hz ~700pc) for a canonical disk decomposition 
(e.g., Juric et al. 2008). However, Fig. 13 shows a different picture. There is a 
continuous distribution of IJf(g{hz), with no sign of bi-modality, a distribution 
that is quite well-approximated by Siig{hz) oc exp (^hz/280pcj. The distribution 
of IJfig{az) shows analogous behavior. 

This shows directly that while the Disk has a wide range of scale heights and 
temperatures, thinking of only two distinct thin and thick disk components is not 
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Fig. 13 The Milky Way has no distinct thick disk. This Figure, from Bovy et al. 
(2012b), shows the stellar surface mass density contributions of individual MAPs 
(colored symbols), each of which has associated a unique vertical scale height. 
The black histogram shows the total stellar surface mass density contributions 
from MAPs with scale height hz , which is exponentially decaying towards higher 
/iz, and shows no hint of thin-thick disk bimodality. 



consistent with the data: the Milky Way has no distinct thick disk (Bovy et al. 
2012b). A few things are important to keep in mind: First, this result was able 
to emerge because selecting MAPs only by their abundances, and being able to 
associate a unique scale-height to each star, simply on the basis of its [X/H], has 

enabled a very different look at the Disk. Second, the demonstrated [X/H] preci- 
sion of SDSS/SEGUE argues strongly against the smoothly decreasing Sii„{hz) 
distribution as merely a consequence of poor abundance determinations (see also 
Bovy et al. 2012d). Third, this way of looking at the Disk, as made up from a 
continuum of components of many different scale heights (or temperatures), does 
not change the integrated disk properties (mass- weighted cr^ or h^), compared to 
the traditional thin-thick disk dichotomy; it just shows that there is no structural 
or kinematical dichotomy^. 

To test for consistency of this picture with previous work, we can 're-assemble' 
our decomposition of the disk into MAPs into the overall spatial, kinematical, 
and elemental-abundance structure of the Disk. Gilmore & Reid (1983) dcter- 

^ Perhaps the clearest arguments for a thin-thick disk dichotomy come from bimodal- 
ity in [X/H], which are discussed below. 
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Fig. 14 Overall vertical density profile in the Solar neighborhood implied by the 
MAP decomposition. The full line is the mass-weighted density profile obtained 
by summing the contributions from the various MAPs (Fig.s 11 and 13). The 
points are a noisy sampling of this density and the straight dashed lines are 
exponential fits to the low- and high-height 'data points'. This Figure can be 
directly compared to Figure 6 of Gilmore & Reid (1983). The sum of the two 
exponentials is the curved dashed line, which is barely distinguishable from the 
full line, showing that the Disk's overall vertical density profile implied by MAPs 
can still be represented by a two-exponential fitting function. 

mined the overall vertical structure of the Disk in the solar neighborhood out to 
several kpc and found that it can be represented as the sum of two exponential 
distributions: a 'thin disk' with a scale height of 300 pc and a 'thick disk' with 
a scale height of 1350 pc. Later observations have confirmed this measurement 
(e.g., Juric et al. 2008). We can compare the MAP measurements of the Disk's 
structure to these studies by synthesizing the overall structure of the Disk — i.e., 
not split by elemental abundance — by summing the (mass-weighted) contribu- 
tions from the MAPs. In Fig. 14, we show the overall (mass-weighted) vertical 
density profile in the Solar neighborhood implied by the MAP measurements. 
This Figure shows that the overall density can be well described by the sum of 
two exponential distributions, a 'thin disk' at low heights and a 'thick disk' that 
starts dominating at heights > 1 kpc, even though the distribution is made up of 
dozens of MAPs with the scale height distribution of Fig. 13. Thus, the MAP de- 
composition does not conflict with the measurement of Gilmore & Reid (1983), 
but the decomposition into MAPs based on elemental abundances has allowed a 
qualitatively somewhat different description of the overall Disk structure to be 
found; the description in terms of two exponentials can certainly be views as a 
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Fig. 15 Comparison of the mctallicity gradients implied by the MAP way of 
looking at the Disk with a direct estimate from Schlesinger et al. (2011). 

convenient and well-working fitting function to the mass-weighted structure of 
the Disk. 

Analogously, in the MAP way of looking at the Disk, any vertical or ra- 
dial gradients in abundances or kinematical properties arise foremost from the 
changing contributions of different MAPs at a given {R,z). Of course, given a 
set of MAPs, one can construct spatial gradients in the population mean of, 
e.g., metallicity or velocity dispersion. We illustrate for the concrete case of the 
SEGUE-dcrived MAPs and for z-gradients that this leads to predictions that 
are consistent with direct measurements. As an example. Fig. 15 shows that 
the predicted gradients from the Bovy et al. 2012b, c,d MAPs and the direct 
abundance gradients from Schlesinger et al. (2011). Similarly, integrating over 
all abundances and deriving (Jz{z) leads to a nearly linear rise of the dispersion 
with height above the plane. 

We can also use the MAP model for the Disk to predict the [Fe/H]- [a/Fc] 
distribution at jz] ~ 0. The MAP abundance distribution extrapolated to \z\ < 
50 pc is shown as contours in Fig. 16, where it is compared to the observed distri- 
bution from Adibekyan et al. (2012). The [a/Fe] in this Figure is the combina- 
tion measured by SEGUE— 0.5 [Mg/Fe] -H0.3 [Ti/Fe] -HO.l [Ca/Fe] -hO.l [Si/Fe]— 
and we have subtracted 0.06 from the model's [a/Fe] to put the [a/Fe] on ap- 
proximately the same scale. The MAP model, constrained at [^l > 300 pc, does 
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Fig. 16 Abundance distribution of stars within < 100 pc based on high-dispersion 
spectroscopy (based on data form Adibekyan et al. (2012)). This Figure shows 
that the distribution of [a/Fe] -abundances appears to be bi-modal, albeit not 
as two disjoint distribution as claimed by Fuhrmann (2011); see discussion in 
Section 6.3. 



a remarkably good job of producing the main features of the \z\ < 50 pc abun- 
dance distribution, except for appearing too metal-poor, which is probably due 
to an imperfect description of the more metal-rich, |z| < 300 pc stellar content. 

The dissection and subsequent global synthesis of the Disk through MAPs 
is ultimately an empirical approach that is distinct from synthetic models such 
as the Besancon model (Robin et al. 2003), Trilegal (Girardi et al. 2005), or 
the model of Just & Jahreifi (2010), which we do not discuss in detail here. 
These models build a global Disk model using a mix of stellar populations of 
different spatial, kinematical, and chemical properties through the application of 
stellar-population synthesis models and observational relations and correlations 
between the model variables, such as the age-velocity relation and metallicity 
gradients. The strength of these synthetic Galaxy models lies in their ability to 
summarize our current understanding of the Disk into a consistent — if not always 
dynamically consistent — stellar-populations-based model that can be used to 
simulate the expected content of new surveys (e.g., Robin et al. 2012), test Disk 
analyses on realistic mock data, and provide fiducial or background models for 
the interpretation of new data. 
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6.3. The abundance distribution of the Disk stars 

Beyond a few 100 pc, the abundance distribution of the Disk has only been 
studied very recently with sizable samples (Boeche et al. 2011). However, in the 
immediate Solar neighborhood a number of studies, usually based on the Hip- 
parcos catalog, have revealed quite striking abundance patterns in their sam- 
ples (especially Bensby et al. 2003; Nordstrom et al. 2004; Bensby et al. 2005; 
Fehzing & Bensby 2008; Navarro et al. 2011; Fulirmann 2011). To be explicit, 
we take the chemical abundance distribution to be the mass- weighted probability 
distribution of chemical abundances, pmass([X/H] z) at (i?, z). It is worthwhile 
stating this explicitly, because the published sample abundance distributions of- 
ten differ (for good reasons) drastically from this Pmass distribution, making a 
direct comparison difficult if not impossible. In particular, many studies have ex- 
plicitly included kinematical pre-selections of the target stars (e.g., Bensby et al. 
2003; Feltzing & Bensby 2008; Bensby et al. 2011); others, e.g. Navarro et al. 
(2011), are foremost literature compilations without quantitative accounting for 
the selection function. 

Rightfully prominent among these abundance distribution studies are those 
of Fuhrmann (Fuhrmann 1998, 2004, 2008, 2011), which assembled an approx- 
imately volume-limited sample of about 300 stars within 25 pc of the Sun (i.e., 
a volume of 6 x 10~^kpc"^), with precise individual element abundances from 
high-resolution spectroscopy. Fuhrmann finds a distribution that is strikingly 
bi-modal in the [Fe/H] — [a/Fe] plane. Recent studies, initially geared at exo- 
planet searches, have yielded similar information for stars at typical distances of 
~ 100 pc (Adibekyan et al. 2012): they, too, find a bi-modal distribution in the 
[Fe/H] — [a/Fe] plane, though they do not confirm an actual 'gap' in the abun- 
dance plane, as Fig. 16 shows. So, detailed abundance studies based on high- 
resolution spectra show consistently, that when nearby samples are split by their 
motions into those with kinematics typical 'thin' and of 'thick' disk stars, their 
abundance patterns are distinctly and clearly different (see also Bensby et al. 
2003; Nordstrom et al. 2004; Bensby et al. 2005; Feltzing & Bensby 2008). 

The questions remains to which extent this abundance bimodality in itself 
argues that there is (at least chemically) a distinct thick disk component 
(cf. Fuhrmann's work), attributable to an early (thick disk) star- formation 
epoch in the Galaxy, and a significantly later thin disk formation epoch, 
with a star-formation hiatus in between. Because in contrast to that view, 
(Schonrich & Binney 2008) have argued that ([a/Fe]-)abundance bimodality, 
though no gap, could arise from a perfectly 'smooth' star-formation and enrich- 
ment history. This is because the [Q!/Fe]-enhancement fades rather rapidly, once 
SN la enrichment becomes important. 

At the moment, excellent abundance information over very small volumes tell 
a somewhat different story about an abundance dichotomy in the stellar Disk 
population than the less precise information on 1 kpc scales. The next years, 
providing accurate abundances for large samples over kpc-scales from APOGEE 
and Gaia-ESO should sort this out. 
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In summary, the authors' view — certainly guided and perhaps tainted by 
their own work — points towards a structural description of the Disk in terms of 
an approximately continuous distribution of stellar scale-heights or velocity dis- 
persions, which appears inconsistent with a structural thin-thick Disk dichotomy 
or clear bi-modality. The recent, vastly larger data sets with abundances seem 
to imply that a thin-thick disk description is too simplistic in, rather than being 
'wrong'. Yet, as of this writing, there is no universal acceptance in the commu- 
nity of this view, for understandable reasons: on the one hand, there is a large 
number of previous results that seem very well explained by a thin-thick disk 
dichotomy; while for some of these results, consistency with the continuously 
varying MAP picture has been demonstrated (see above and cf Fig. 14, and 
Fig. 15); other previous results still await their comparison with the continuous 
MAP picture. On the other hand, the clement abundance structure in the Solar 
neighborhood (< 100 pc) does show clear evidence for abundance-bimodality, 
with some data sets pointing towards an actual dichotomy. These have yet to be 
fully confronted or reconciled with the continuous MAP picture. 

An currently open question is whether this MAP view of the galactic disk 
provides useful and new constraints on the extensive body of chemical evolution 
models of the disk that has been developed over the last two decades (e.g., 
Matteucci & Francois 1989; Prantzos & Aubert 1995; Chiappini et al. 2001; 
Fraternali & Binney 2008; Marinacci et al. 2011). An exploration of this issues 
is beyond the scope of this review. 

Taken together, viewing the Disk as a superposition of MAPs appears as a 
promising framework for studies ahead, irrespective of the question of a thin- 
thick dichotomy. In the final Section of this Review, we will sketch two aspects 
where MAPs are useful, which will bring us to the initial issues, the implications 
of Disk studies for dynamics and galaxy formation. 

7. Whither Disk studies? 

In the closing Section of this review, we now return to the two broad issues laid 
out in the introduction: First, what steps still need to be taken to understand the 
3D gravitational potential near the Disk as well as we can, and to draw optimal 
inferences about the dark matter distribution in the Galaxy. Second, how can 
the empirical description of the Disk be linked to galaxy formation mechanisms? 
We start by describing emerging approaches towards comprehensive dynamical 
modeling, followed by illustrative initial comparisons of the Disk structure to 
cosmological simulations in the frame-work of MAPs. We end with our best guess 
of what the conceptual and practical modeling and interpretation challenges are 
in the Gaia era. 

7.1. Towards implementing comprehensive dynamical modeling 

Approaches to stringent dynamical modeling of the Disk, which arc going beyond 
the modeling frameworks described, have also greatly progressed foremost owing 
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to J. Binncy and P. McMillan. Yet, all the elements still have not yet quite 
come together to address: what are the best quantitative constraints on the 
Galactic potential and on the Disk's distribution functiononc can derive from the 
available, or soon available, data? Neither is there a stringent tool for experiment 
forecasting' to address what pieces of observational data are most informative 
on a given dynamical questions. 

These questions require models that are both global, dynamically consistent, 
and can predict the likelihood of diverse data sets. The following section reflects 
not only the authors' thoughts and attempts for a roadmap towards such a mod- 
eling machinery, but extensively draws on the work and plans laid out eloquently 
by Binney and McMillan (Binney 2010,Binney & McMiUan 2011,Binney 2012, 
McMillan & Binney 2012): 

— Given the complexity of the problem, fully dynamically self-consistent models 
should be implemented in the steady-state, axisymmetric regime first, before 
proceeding to non-axisymmetric, time-dependent Disk modeling. That alone 
would provide enormous progress over the status quo. The limitations of such 
modeling can be straightforwardly tested against mock-data from numerical 
disk simulations, which have now reached high-enough resolution(> 300 Mil- 
lion particles, e.g., D'Onghia et al. 2012). 

— Given the vast number of discrete observational constraints, model predic- 
tions need to be continuous in x,v, to directly calculate the likelihood for 
(discrete) data. This argues for a 'distribution function based' approach, 
rather than a discrete particle or orbit based representation of the distribu- 
tion function. 

— To make the joint optimization, or sampling, of <l>{x) and distribution func- 
tion in light of large data sets practical, both ^{x) and distribution function 
should be dcscribable by a modest number of parameters. For the gravita- 
tional potential this leaves two options: either describe the potential by a 
number of discrete, traditional components, such as the bulge, bar, halo,thin 
disk, thick disk and gaseous disk (e.g., McMillan 2011). As the mono- 
abundance analysis of Bovy et al. (2012b, c,d) indicates that a sub-division 
into a few discrete Disk components is not sensible, a description of ^(a;) in 
terms of two parameterized 'components', a spheroidal (not spherical) one 
and a flat, disk-like component, with flexibility in their r, and R, z profiles, 
seems like a more sensible starting point. 

— The family of distribution functions advocated by Binney & McMillan 
(2011), which is cast in terms of actions appears as a conceptually attractive 
and now also practical approach. Specifically, a distribution functioncast in 
terms of the radial, azimuthal and vertical actions J = {Jr, = L^, Jz) of 



the form 




(11) 
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seems like a sensible choice (Binney 2012), where Aj is a set of parameters 
to describe the distribution function. Elements of Aj in this case are fL^), 
Cr{L^), and Dr{L^) (Ting et al. 2012b). OveraU f{L^) sets the radial profile 
of the disk, given <?(a;); Cr{Lz) and Dr^Lz) set the radius-dependent disk 
'temperature' in the radial and vertical direction. Ting et al. (2012b) showed 
that these distribution function families can reproduce the radial and vertical 
properties of mono- abundance Disk populations well, demonstrating that 
suitable simply-parameterized families of distribution functions are available. 
While action-based distribution functions are easy and elegant to write down, 
community acceptance seems to have been hampered by their reputation to 
be hard (or slow) to calculate, beyond the 'azimuthal action', = Lz, which 
is the angular momentum. Recent progress in calculating and testing the 
accuracy of approximate actions (Solway et al. 2012; Binney 2012; Sanders 
2012) should help overcome this issue. 

— 'Modeling' requires efficient computation of {x,v) -s-> {Jr, Lz, Jz), given 
(p{x). Clearly, {x,v\<P{x)) {Jr,Lz,Jz) is easier to calculate, as this does 
not involve explicit treatment of the orbital angles that complement the 
actions; in steady-state modeling, the angles are assumed to be distributed 
uniformly. This argues for modeling approaches that only go computationally 
from configuration to action space. 

— Yet, models need to be evaluated against data in the 'configuration space' 
of observable data, i.e., we have to determine £({data}|Aj, <^(a;|A<|,))) , 

with {data} = {p{x,v,[X/il],ts,ge)}i- This will then inform us about the 
potential after marginalizing over the distribution function parameters, 
£({data}|^(a;|A<f)); or about the distribution function after marginalizing 
over ^(cc), £({data}|Aj) . 

— To calculate meaningful likelihoods of the data for different Xj given <P{x\X4,), 
it is necessary to interpret the distribution function as a probability distri- 
bution. This is where in this context the spatial sample selection functions 
come in (e.g., McMillan & Binney 2012). One approach to incorporating it 
(Ting et al. 2012b)) is to 'normalize' the distribution function over the ob- 
servable volume, f{Xj) Cgcicction ■ /(Aj), where 

Cs~election= / dxdv Psdcctionix) ■ f {j (x , v)\Xj , ^{x\X^)) . (12) 



— Of course, the 'data', {p{x,v, [X/H], tage)}i are not precise points in {x,v), 
but have uncertainties, or may even be missing in some dimensions. In prac- 
tice, this implies yet another mar^ inalization over the data's uncertainties, 

i.e., an integral over 5x,5v,6[X/B],6ti,gc- How to do this computationally 
efficiently and well enough, is yet to be clarified. 

— All of the observables of course depend on the Sun's position and motion, 
where the aspects of the observed velocities that simply are the Sun's reflex 
motion are a notorious source of uncertainty. As recent work has showed, the 
'local standard of rest' is still under extensive debate : as Bovy et al. (2012a) 
showed, the Sun's motion should also be an explicit model parameter. 
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— Any modeling should exploit the measured chemical abundances as 'integrals 
of motion' that separate sub-populations. In essence, the above procedure 
should hold for any MAP, with a distinct distribution function for each, but 
of course the same <P{x) for all. By adding the likelihood contributions from 
all the map's, inferences about 'P{x) should be straightforward. 

— Finally, the dynamics of the Disk do not care about the particulars of the 
various surveys described in Section 3.3, which offer observational constraints 
in different {£, b, D) regimes. Hence models need to be fit simultaneously to 
data from different surveys, which has happened far too little to date. Again, 
this should be straightforward, as any given model (distribution function, 
<?(a;)) can predict the likelihoods for any survey, and data (log-)likelihoods 
simply have to be added. The practical difficulties lie in the tedious task of 
compiling the various sample selection functions. 

As of now, only initial demonstrations of theses approaches exist, retrieving 
information from pseudo-data: McMillan & Binney (2012) showed how well the 
distribution functions of ensembles of 5000 stars could be retrieved, if a pri- 
ori disjoint thin and thick disk components were presumed. Ting et al. (2012b) 
explored how well the parameters of a simplified 3D model for the Galactic po- 
tential could be retrieved, with mock-survey data that resemble SDSS/SEGUE 
G-dwarfs; they could show that the shape of the potential could be constrained, 
but that such constraints are much harder to get than constraints on, say, Wcirc- 

Overall, the above roadmap shows that basically all elements are in place 
to do such comprehensive modeling. This suggest that much more can and will 
be learned about the Galactic potential, drawing on existing data well before 
the first extensive Gaia releases, even if those stay on their 2012 schedule. This 
should provide us with a much firmer picture of (l>{x) for the Disk and with a 
solid base-line distribution function description of the Disk. 

7.1.1. Chemo-dynamical substructure in the Disk 

A distribution function description of the Disk and its MAPs will also provide a 
much more sensible basis to characterize deviations from it as "sub-structure". 
Note that sub-structure can be both 'clumps' in action-space as well as in angle 
space, an aspect that has yet to be explored. 

7.2. Mono- abundance populations in a galaxy formation context 

A second avenue for the near future is to place the emerging empirical results 
about the Disk in the context of galaxy formation. Here we outline what we deem 
to be a promising approach, looking at MAPs in cosmological disk formation 
simulations. As laid out in Section 6, an empirical picture seems to emerge in 
which the Disk is 'seen' to exhibit an age-sequence from 'thick' (large h^) and 
centrally-concentrated (small Re) MAPs to thin and radially-extended ones, if 
[a/Fe] and [Fe/H] can indeed provide an approximate relative age ranking of 
MAPs in the Disk. 
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Fig. 17 Present-day structure of 'mono-age populations' in a cosmological forma- 
tion simulation than led to a Milky- Way-like galaxy from Stinson et al. (2013), 
where each panel shows a present-day, edge-on of stellar subsets in the simula- 
tions, sorted by their age: there is a clear trend of 'old, thick, centrally concen- 
trated' to 'young, thin, extended'. 



In two ways, this picture qualitatively resonates with well established con- 
cepts of galaxy formation. One the one hand, disk galaxy formation should 
proceed from the inside out (e.g.. Mo et al. 1998). On the other hand, there 
arc several reasons why older components of the Disk should be thicker (or 
have higher velocity dispersion): they may have been born with higher disper- 
sion (Bournaud et al. 2009; Forster Schreiber et al. 2011), they may have been 
heated by subsequent tidal interactions or satellite infall (e.g., Quinn et al. 1993); 
outward radial migration appears to conserve approximately the vertical ac- 
tion (not the vertical energy, cf. Schonrich & Binney (2009)), which will slightly 
thicken the disk portions that have migrated outward and slightly decrease their 
vertical velocity (Solway et al. 2012; Minchev et al. 2012a). 

To push a comparison with expectations in the cosmological context further, 
predictions of galaxy formation simulations for MAPs are needed. As described 
in Section 2, disk galaxy formation simulations have made enormous strides in 
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Fig. 18 How well do MAPs reflect mono-age populations in simulations. This 
Figure from Stinson et al. (2013), whose simulations track chemical enrichment, 
shows the age dispersion of stars within each (simulated) MAP bin. For most 
MAPs the age dispersion is less than 1 Gyr, implying that MAPs arc reasonably 
good approximations to mono-age populations. 

producing outcomes that resemble Milky Way like galaxies (Guedes et al. 2011; 
Martig et al. 2012; Stinson et al. 2013), with dominant flat disks and only a 
modest fraction of the stars in a central bulge. Some of these simulations also 
treat the chemical enrichment self-consistently (e.g., Stinson et al. 2013) and 
these simulations provide an initial, at least qualitative, comparison of MAPs 
between cosmological simulations and direct Milky Way observations. Recently, 
Stinson et al. (2013) has carried out such a comparison with the patterns found 
by Bovy et al. (2012d). As illustrated in Figure 17, they found remarkable agree- 
ment. This Figure shows edge-on views of the present-day stellar density distri- 
bution in the simulations, sorted according to the [0/Fc] (as proxy for [a/Fe]). 
As observed in the Disk, there is a sequence of centrally-compact and thick con- 
flgurations at the [0/Fc]-enhanccd end to radially-extended, thin distributions 
at Solar [0/Fe] values. Quantifying this picture by plotting hz vs. Re for MAPs 
in the simulations, confirms that this simulation shows the same behavior as the 
data. This agreement lends encouragement to query the simulations about the 
extent to which mono-abundance populations can serve as proxy for mono-age 
populations. Figure 18 shows the mean age and the age dispersion as a function 
of [Fe/H] and [a/Fe] in the simulations. Remarkably, in most mono-abundance 
bins of these simulations, the age dispersion is < 1 Gyr, justifying the use of 
2D-abundances as age proxies. 
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Of course, at present the above is only an anecdotal comparison with one 
simulation; albeit a state-of-the-art simulation that was 'tuned' to match the 
stellar-mass / halo-mass relation of Moster et al. (2012) at rcdshift zero, and 
that produces a large, fairly massive disk and a small bulge, with the spatial 
and temporal abundance pattern only a consequence of it. Clearly, this kind of 
data model comparison warrant much further exploration, as we lay out in the 
final Subsection. 

Some readers may pine for 'tests' of galaxy evolution mechanism that do not 
depend on comparison with numerical simulations. But such tests may be far and 
few between, as the present-day dynamical structure mostly tells us about the 
present day. This can be illustrated by the quest to find tests for the efficacy of 
radial migration. The Hermes/GALAH experiment (Freeman et al. 2010) has set 
out to provide such a clean test: the ambition is to identify, through 'abundance 
fingerprinting', stars that were formed basically at the same time with the same 
orbital actions and phases. If we can look at their orbits (say angular momenta) 
now, and if we know their ages, then one gets a direct estimate of the importance 
of radial migration, though still no clear hint about the cause of that migration 
(bars, spiral arms, satellites, etc.). 

7.5. Peripheral Disk issues 

The Galactic stellar Disk, as defined here, does of course not live in isolation, 
but it interfaces with other Galactic 'components' at its extremes: with the bar 
and (pseudo-)bulge at small radii; with the slightly warped, complex, and almost 
messy outer reaches of the Disk (> 12kpc); and with the Galactic stellar halo, 
which the data suggest is truly a 'distinct' component from the bulk of the 
Disk (Majewski 1993; Ivezic et al. 2008). We review only briefly the status and 
prospects in these areas. 

— The innermost Disk: Massive late-type galaxies like ours often have bars 
at the center (Barazza et al. 2008) and a thick central portion that is de- 
scribed as a pseudo-bulge (e.g., Kormendy & Kennicutt 2004); the Milky 
Way fits that pattern, as it has a large-scale stellar bar (Binney et al. 1991; 
Blitz & Spergel 1991; Weinberg 1992; Binney et al. 1997). This bar must 
have formed from stars that were part of the pre-existing Disk, and clear sep- 
aration between the components is perhaps moot (e.g., Shen et al. 2010). In 
practice, the present end of the bar is a sensible dividing line, presumably at 
co-rotation 2.5±0.5kpc (Binney et al. 1991). As the patterns speeds of bars 
change secularly (Debattista & Sellwood 2000), it is unclear — and should be 
clarified — whether this radius implies any changes in the abundance or age 
structure. 

— The Outer Fringes of the Disk: The stellar disk of our neighbor galaxy, 
M31, is manifestly frayed in its very outer parts (Ferguson et al. 2002; 
Ibata et al. 2005), possibly because much of the material is tidally disrupted 
satellite debris or possibly because disk material has been grossly perturbed 
by such infall events. There is ample evidence that the outer parts of 
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the Galactic Disk are just as messy: the structure variously known as the 
'anticenter ring', 'Monoceros ring', 'Canis Major feature' (Yanny et al. 2003; 
Martin et al. 2004; de Jong et al. 2010) reflects the fact that there are far 
more stars at i?GC = 15 to 20kpc (Conn et al. 2012) with \z\ > 1 kpc than 
a simple double exponential disk model suggests. This 'feature' of the outer 
Disk has elicited extensive discussion, as two very different (and in the pure 
form probably simplistic) explanations have been advanced to explain it: 
either purely as deposited stellar debris from a satellite that had merged on a 
low-latitude, low-eccentricity prograde orbit (e.g., Pefiarrubia et al. 2005), or 
simply as a combination of a disk warp and flare (e.g., Momany et al. 2006). 
Because the abundance distribution and kinematics of possible 'Monoceros 
ring' member stars are not grossly different from the expectations for an 
outer disk, it will take comprehensive area coverage (e.g., from PSl), good 
distances from Gaia, and extensive spectroscopy (lOOO's of stars) yielding 
velocities and abundances to sort out to which extent these are stars that 
have been 'dragged into' the Disk vs. 'kicked out of the Disk. 



7.4- Some specific tasks for the next years 

Having described the various elements of gathering, analyzing and (dynamically 
and cosmologically) modeling stellar Disk surveys, we now try to cast answering 
some of introductory questions in this review into possible projects that appear 
feasible and necessary for the next years: 

7.4.1. What factors limit dynamical Disk modeling? 

With the modeling machinery of Section 7.1 in place, at least in principle, all 
elements have come together to answer this question. The most straightforward 
approach to doing this is to create mock data sets from iV-body simulations of 
disks that are self-consistent by construction, and feed them though the modeling 
machinery to see how well <l>{x) and distribution function can be recovered. 
Recently, non-cosmological simulations of disk galaxies have reached particle 
sizes of ~ 300 Million particles (D'Onghia et al. 2012), where mock-data sets 
that match the current generation of survey data (RAVE, SDSS, APOGEE, 
etc.) can be drawn directly without supersampling. Then selection functions, 
distance estimates with either random or systematic errors, and (appropriate 
or inappropriate) de-reddening estimates can be applied, where the reddening 
model would be a version of the currently emerging 3D extinction maps for the 
Milky Way (cf. Section 3.2.4). Deriving probability distribution functions on 
(I>{x) in light of a suite of mock data sets, would reveal the factors that limit 
the accuracy of the potential inference. Similar, or even the same, mock-data 
sets can explore to which extent non-axisymmetry of disk galaxies such as the 
Milky Way, in particular spiral arms and a central bar, would affect the (p(x) 
inferences made under axisymmetric assumptions. In turn, with such analyses 
one can explore what it would take to, e.g., measure the strength of the spiral 
arms dynamically. What the mock data, at least in the very high resolution. 
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hence 'non-cosmological' simulations, do not easily supply is an 'abundance tag' 
that allows to define MAPs. Hence, the best way to test how much is gained in 
dynamical analyses by splitting the tracer samples into MAPs still needs to be 
devised. Current experiments along those lines (Ting et al. 2012b) indicate that 
the sheer sample size plays a sub-dominant role in constraining the potential in 
the current regime (> 10^~^ stars). Such mock data sets will also be crucial in 
finding out whether vast numbers of stars with partial phase-space information 
(e.g., without vios) have comparable information content than far smaller sets 
with complete phase-space information. 

7.4.2. How to get to the best chemo-orbital distribution function? 

In the context of comprehensive dynamical modeling (Section 7.1), an estimate 
of the distribution function of course results 'automatically'. Yet, the issues of 
getting an optimal estimate for distribution function are different. On the one 
hand, slight systematic errors in <l>(x) probably lead to a (smooth) distribution 
function that looks very similar to the correct one; so to get a sense of whether 
the vertical action distribution of MAPs varies little with radius (as expected 
for asymptotically-efficient radial migration), an approximate potential probably 
suffices. On the other hand, it is perhaps distribution function sub-structure or 
fine-scale structure that is most interesting. In the limit of a 'cold stellar stream', 
i.e., stars of the same actions that differ only in phase (e.g., Koposov et al. 2010), 
the most plausible potential may be the one that makes the distribution function 
of that stream most like a (5-function, even if that potential is not necessarily the 
most likely in light of the overall set of tracers. Further, sample size, precision 
of the individual {x,v) estimates, and chemical abundances matter far more 
than for merely estimating <P{x). Sample size matters, because diagnostically 
precious sub-populations (e.g., streams) may only make up a very small fraction 
of the Disk mass; (x, t;)-precision matters, because 'cold' substructures, those 
very compact in distribution function space, are of preeminent interest, and 
[X/H] matters, because the distribution function has to be strictly separable in 
chemical abundances, which resemble effective separating integrals of motion. 
Therefore, even with only an initial estimate of distribution function, a full 
exploration of MAP distribution functions, with the largest possible samples 
and the most precise phase-space coordinate estimates will be the next step. 
This is clearly also a direction where dramatic progress using Gaia data will be 
very straightforward. 

7.4.3. The next steps mapping the gravitational potential 

Initial attempts have been made to bring the diverse constraints on the Galac- 
tic gravitational potential on the same footing (Catena & Ullio 2010; McMillan 
2011; Bovy & Tremaine 2012), the outer halo constraints, the rotation curve, and 
the local dark matter estimates from Kz{z). But the step of measuring Kz{z) 
as a function of Rqc i which is feasible even with the existing data from 6 kpc to 
12 kpc, is currently missing. Getting the dynamically-measured disk mass scale 
length (presumably over 2 to 3 scale lengths, i.e., an order of magnitude in S{R)) 
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will break the 'disk-halo degeneracy' and allow us to take strides towards better 
quantifying the amount of dark matter within Rq. In constraining the rotation 
curve, the emerging stellar surveys will eventually allow for testing less con- 
strained models, and the combination of stellar kinematics with good distances 
and the emerging results from the Galactic maser survey (Brunthaler et al. 2011) 
will be the most powerful constraints on global asymmetries of the Galactic po- 
tential, such a lopsidedness. This is because masers are the most feasible tracer 
of the rotation curve on the far side of the Galaxy. Untangling the disk and halo 
contributions to the potential at i? < 10 kpc will then permit tighter constraints 
on the shape of the dark matter and — ultimately — a test for the existence of 
dark matter (as opposed to alternative gravity laws) on the bases of the shape 
of the acceleration/potential map alone; such analyses probably require Gaia 
data, foremost to get very good proper motions to D ~ 10 kpc. 

7.4.4. Closing remarks 

In summary, an enormous amount of practical work needs to be carried out and 
conceptual modeling issues need to be sorted out, even before Gaia data are 
available. It seems crucial to do this work, even if some aspects (but clearly not 
all — abundances, radial velocities of faint stars) will be dramatically superseded 
by Gaia. This is because the currently emerging data will be key in making sure 
we are asking the right questions of the Gaia data, and think through what the 
most critical 'complementary data' to ESA's next flagship mission arc. In return, 
the field promises a information revolution that is probably unmatched in the 
field of galaxy studies in this decade, ALMA and JWST notwithstanding. 

But the attentive and persistent reader will have noticed that we have not 
fully closed the circle on using the Galactic Disk to test 'mechanisms of disk 
galaxy formation'. While Section 7.2 has sketched some specific examples of 
how to test aspects of disk galaxy formation by comparing ab initio simula- 
tions to the data, it has not tackled head-on some of the broad initial questions 
(Section 2.2): To which extent where stars born (vertically) hot or subsequently 
heated? Is there testable evidence that the feedback implemented in the simula- 
tions actually took place? Can we recognize stellar satellite debris well enough 
to quantify the fraction of Disk stars with external origin? 

That means that also much conceptual work needs to be done. It may serve 
as a useful compass to thoroughly think about the following scenario: if one 
had the perfectly 'analyzed and modeled' Gaia data (and those from all other 
surveys) at hand, what are the crispest inferences one could draw about how our 
Galaxy and other disk galaxies, have formed? 
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